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Preface 


OBJECTIVES 

The  last  years  have  seen  a  thriving  research  activity  on  cooperative  control 
and  motion  coordination.  This  interest  is  motivated  by  the  growing  possi¬ 
bilities  enabled  by  robotic  networks  in  the  monitoring  of  natural  phenomena 
and  the  enhancement  of  human  capabilities  in  hazardous  and  unknown  en¬ 
vironments. 

Our  first  objective  with  this  book  is  to  present  a  coherent  introduction  to 
basic  distributed  algorithms  for  robotic  networks.  This  emerging  discipline 
sits  at  the  intersection  of  different  areas  such  as  distributed  algorithms,  par¬ 
allel  processing,  control,  and  estimation.  Our  second  objective  is  to  provide 
a  self-contained,  broad  exposition  of  the  notions  and  tools  from  these  areas 
that  are  relevant  in  cooperative  control  problems.  These  concepts  include 
graph-theoretic  notions  (connectivity,  adjacency  and  Laplacian  matrices), 
distributed  algorithms  from  computer  science  (leader  election,  basic  tree 
computations)  and  from  parallel  processing  (averaging  algorithms,  conver¬ 
gence  rates),  and  geometric  models  and  optimization  (Voronoi  partitions, 
proximity  graphs).  Our  third  objective  is  to  put  forth  a  model  for  robotic 
networks  that  helps  to  rigorously  formalize  coordination  algorithms  running 
on  them.  We  illustrate  how  computational  geometry  plays  an  important  role 
in  modeling  the  interconnection  topology  of  robotic  networks.  We  draw  on 
classical  notions  from  distributed  algorithms  to  provide  complexity  measures 
that  characterize  the  execution  of  coordination  algorithms.  Such  measures 
allow  us  to  quantify  the  algorithm  performance  and  implementation  costs. 
Our  fourth  and  last  objective  is  to  present  various  algorithms  for  coordina¬ 
tion  tasks  such  as  connectivity  maintenance,  rendezvous,  and  deployment. 
We  put  special  emphasis  on  analyzing  the  correctness  of  the  algorithms  and 
providing  measures  of  their  complexity. 

The  thematic  variety  of  the  exposition  is  also  present  in  the  proofs  of 
the  main  results  of  the  book.  The  technical  treatment  combines  control- 
theoretic  tools  such  as  Lyapunov  functions  and  invariance  principles  with 
techniques  from  computer  science  and  parallel  processing  such  as  induction 
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and  message  counting. 

INTENDED  AUDIENCE 

The  intended  audience  of  this  book  are  first-  and  second-year  graduate  stu¬ 
dents  in  control  and  robotics  from  Computer  Science,  Electrical  Engineer¬ 
ing,  Mechanical  Engineering,  and  Aerospace  Engineering.  A  familiarity  with 
basic  concepts  from  analysis,  linear  algebra,  dynamical  systems,  and  con¬ 
trol  theory  is  assumed.  The  writing  style  of  the  book  is  mathematical:  we 
have  aimed  at  being  precise  in  the  introduction  of  the  notions,  the  state¬ 
ment  of  the  results,  and  the  formal  description  of  the  algorithms.  This  is 
complemented  by  numerous  examples,  exercises,  and  a  special  effort  carried 
throughout  the  book  at  motivating  the  introduction  of  concepts  and  giving 
intuitive  explanations  behind  the  results. 

Researchers  in  the  fields  of  control  theory  and  robotics  who  are  not  aware 
of  the  literature  on  distributed  algorithms  will  also  benefit  from  the  book. 
The  book  uses  notions  with  a  clear  computer-science  flavor  such  as  syn¬ 
chronous  networks,  complexity  measures,  basic  tree  computations,  and  lin¬ 
ear  distributed  iterations,  and  integrates  them  into  the  study  of  robotic 
networks.  Likewise,  researchers  in  the  fields  of  distributed  algorithms  and 
automata  theory  who  are  not  aware  of  robotic  networks  and  distributed  con¬ 
trol  will  also  find  the  book  useful.  The  numerous  connections  that  can  be 
drawn  between  the  classical  study  of  distributed  algorithms  and  the  present 
book  provide  a  friendly  roadmap  to  step  into  the  field  of  controlled  coordi¬ 
nation  of  robotic  networks. 

BOOK  OUTLINE 

Chapter  1  presents  a  broad  introduction  to  distributed  algorithms  on  syn¬ 
chronous  networks.  We  start  by  presenting  basic  matrix  notions  and  a 
primer  on  graph  theory  that  makes  special  emphasis  on  linear  algebraic 
aspects  such  as  adjacency  and  Laplacian  matrices.  After  this,  we  introduce 
the  notion  of  synchronous  networks,  and  present  time,  communication,  and 
space  complexity  notions.  We  examine  these  notions  in  basic  algorithms 
such  as  broadcast,  tree  computation,  and  leader  election.  The  chapter  ends 
with  a  thorough  treatment  of  linear  iterations  and  averaging  algorithms. 

Chapter  2  presents  basic  geometric  notions  that  are  relevant  in  motion  co¬ 
ordination.  Robotic  networks  have  a  spatial  component  which  is  not  always 
present  in  synchronous  networks  as  studied  in  computer  science.  Geometric 
objects  such  as  polytopes,  Voronoi  partitions,  and  geometric  centers  play 
an  important  role  in  modeling  the  interaction  of  robotic  networks  with  the 
physical  environment.  Proximity  graphs  allow  us  to  rigorously  formalize  the 
interconnection  topology  of  a  network  of  robotic  agents,  and  characterize 
the  spatially-distributed  character  of  coordination  algorithms.  This  notion 
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is  a  natural  translation  of  the  notion  of  distributed  algorithms  treated  in 
the  previous  chapter.  The  chapter  concludes  with  a  detailed  discussion  on 
concepts  from  geometric  optimization  and  multicenter  functions. 

Chapter  3  introduces  a  model  for  a  group  of  robots  that  synchronously 
communicate/sense  locally,  process  information,  and  move.  We  describe  the 
physical  components  of  the  robotic  network  and  introduce  a  formal  notion 
of  motion  coordination  algorithm  as  a  control  and  communication  law.  Gen¬ 
eralizing  the  notions  introduced  in  Chapter  1,  we  introduce  the  notion  of 
task  and  of  time,  communication,  and  space  complexity.  We  illustrate  these 
concepts  by  means  of  a  simple  and  insightful  example  of  a  group  of  robots 
moving  on  a  circle. 

Chapter  4  analyzes  in  detail  two  coordination  tasks:  connectivity  main¬ 
tenance  and  rendezvous.  The  objective  of  “connectivity  maintenance”  is  to 
establish  local  rules  that  allow  agents  to  move  without  losing  the  connec¬ 
tivity  of  the  overall  networks.  The  objective  of  “rendezvous”  is  to  establish 
local  rules  that  allow  agents  to  agree  on  a  common  spatial  location.  We 
present  coordination  algorithms  that  achieve  these  tasks,  making  use  of  the 
geometric  concepts  introduced  in  the  previous  chapters.  Furthermore,  we 
provide  results  on  the  correctness  and  complexity  of  these  algorithms. 

Chapter  5  considers  deployment  problems.  The  “deployment  problem” 
objective  is  to  establish  local  rules  that  allow  agents  to  achieve  optimal 
network  configurations  in  an  environment  of  interest.  Here,  optimality  is 
defined  using  the  multicenter  functions  from  geometric  optimization  intro¬ 
duced  in  Chapter  2.  We  present  coordination  algorithms  that  achieve  these 
tasks,  characterizing  their  correctness  and  complexity. 

Chapter  6  has  a  dual  purpose.  First,  we  introduce  an  event-driven  control 
and  communication  law,  in  which  computation  and  communication  actions 
are  triggered  by  asynchronous  events,  rather  than  taking  place  on  a  periodic 
schedule.  Second,  we  consider  a  boundary  tracking  problem  and  propose 
an  “estimation  and  balancing”  algorithm  that  allows  a  robotic  network  to 
monitor  a  moving  boundary  efficiently. 

The  reader  will  note  that,  as  the  discussion  progresses,  the  selection  of 
topics  emphasizes  problems  in  which  we  have  been  directly  involved.  There 
are  exciting  topics  that  have  been  considered  in  the  literature  and  are  not 
presented  here  in  depth,  albeit  we  briefly  discuss  a  number  of  them  through¬ 
out  the  exposition.  In  this,  our  first  effort,  we  decided  to  tackle  the  problems 
we  knew  better,  postponing  the  rest  for  the  future.  We  hope  the  reader  will 
appreciate  the  result  and  share,  while  reading  it,  some  of  the  fun  we  had 
writing  it. 
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HOW  TO  USE  THIS  BOOK  AS  A  TEXT 

Our  experience  and  opinion  is  that  this  text  can  be  used  for  a  quarter-  or 
semester-long  course  on  “Distributed  Control”  or  on  “Robotic  Networks.” 
Such  a  course  could  be  taught  in  an  Engineering  or  in  a  Computer  Science 
department.  We  taught  such  a  course  at  our  respective  institutions  over  a 
10  weeks,  3  hours  a  week,  period,  skipping  some  material  and  some  proofs 
(e.g.,  skipping  combinatorial  optimization  in  Chapter  1,  some  of  the  multi¬ 
center  functions  and  the  nonconvex  geometry  treatment  in  Chapter  2,  and 
the  relative-sensing  model  in  Chapter  3).  With  proofs  and  more  complete 
treatment,  we  estimate  the  material  might  require  45  hours  of  lecture  time. 

Finally,  the  complete  latest  version  of  the  manuscript  with  supplementary 
material,  such  as  slides  and  software,  is  freely  available  on  the  internet  at: 

http : // coordinationbook . info 
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Chapter  One 

An  introduction  to  distributed  algorithms 


Graph  theory,  distributed  algorithms,  and  linear  distributed  algorithms  are 
a  fascinating  scientific  subject.  In  this  chapter  we  provide  a  broad  introduc¬ 
tion  to  distributed  algorithms  by  reviewing  some  preliminary  graphical  con¬ 
cepts  and  by  studying  some  simple  algorithms.  We  begin  the  chapter  with 
one  section  introducing  some  basic  notation  and  another  section  stating  a 
few  useful  facts  from  matrix  theory,  dynamical  systems,  and  convergence 
theorems  based  on  invariance  principles.  In  the  third  section  of  the  chapter, 
we  provide  a  primer  on  graph  theory  with  a  particular  emphasis  on  alge¬ 
braic  aspects,  such  as  the  properties  of  adjacency  and  Laplacian  matrices 
associated  to  a  weighted  digraph.  In  the  next  section  of  the  chapter,  we  in¬ 
troduce  the  notion  of  synchronous  network  and  of  distributed  algorithm.  We 
state  various  complexity  notions  and  study  them  in  simple  example  prob¬ 
lems  such  as  the  broadcast  problem,  the  tree  computation  problem,  and  the 
leader  election  problem.  In  the  fifth  section  of  the  chapter,  we  discuss  linear 
distributed  algorithms.  We  focus  on  linear  algorithms  defined  by  sequences 
of  stochastic  matrices  and  review  the  results  on  their  convergence  proper¬ 
ties.  We  end  the  chapter  with  three  sections  on,  respectively,  bibliographical 
notes,  proofs  of  the  results  presented  in  the  chapter,  and  exercises. 

1.1  ELEMENTARY  CONCEPTS  AND  NOTATION 

1.1.1  Sets  and  maps 

We  assume  the  reader  is  familiar  with  basic  notions  from  topology,  such  as 
the  notions  of  open,  closed,  bounded  and  compact  sets.  In  this  section  we 
just  introduce  some  basic  notation.  We  let  x  £  S  denote  a  point  x  belonging 
to  a  set  S.  If  S  is  finite,  we  let  \S\  denote  the  number  of  its  elements.  For 
a  set  S,  we  let  P(S)  and  F(5)  denote  the  collection  of  subsets  of  S  and  the 
collection  of  finite  subsets  of  S,  respectively.  The  empty  set  is  denoted  by 
0.  The  interior  and  the  boundary  of  a  set  S  are  denoted  by  int(S)  and  dS, 
respectively.  If  A  is  a  subset  of  or  equal  to  S,  then  we  write  A  C  S.  If  A  is 
a  strict  subset  of  S,  then  we  write  ACS.  We  describe  subsets  of  S  defined 
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by  specific  conditions  via  the  notation 

{x  E  S  |  condition(s)  on  x}. 

Given  two  sets  Si  and  S2,  we  let  S1US2,  S\  0  S2 ,  and  S 1  x  S2  denote 
the  union,  intersection  and  Cartesian  product  of  Si  and  S2,  respectively. 
Given  a  collection  of  sets  {Sa}aeA  indexed  by  a  set  A,  we  interchangeably 
denote  their  Cartesian  product  by  Sa  or  by  ]}[{iSa  |  a  E  ^4}.  We 

adopt  analogous  notations  for  union  and  intersection.  We  denote  by  Sn  the 
Cartesian  product  of  n  copies  of  the  same  S.  The  diagonal  set  diag(5n)  of 
Sn  is  given  by  diag(Sn)  =  {(s, . . . ,  s)  £5"  |  s  €  5}.  The  set  Si  \  S2  contains 
all  points  in  Si  that  do  not  belong  to  S2. 

We  let  N  and  Z>o  denote  the  set  of  natural  numbers  and  of  non-negative 
integers,  respectively.  We  let  M,  M>o,  R>o  and  C  denote  the  set  of  real 
numbers,  strictly  positive  real  numbers,  non-negative  real  numbers,  and 
complex  numbers,  respectively.  The  sets  Md,  Cd,  and  Sd  C  Rd+1  are  the 
d-dimensional  Euclidean  space,  the  d-dimensional  complex  space,  and  the 
d-dimensional  sphere,  respectively.  The  tangent  space  of  Mrf,  denoted  by 
TMd,  is  the  set  of  all  vectors  tangent  to  Md.  Note  that  TMrf  can  be  identified 
with  xMd  by  mapping  a  vector  v  tangent  to  at  x  E  to  the  pair  (x,  v ). 
Likewise,  T§>d  is  the  set  of  all  vectors  tangent  to  E>d,  and  can  be  identified  with 
§d  xRd.  The  Euclidean  space  Rd  contains  the  vectors  0^  =  (0, . . . ,  0),  1^  = 
(1, . . . ,  1)  and  the  standard  basis  ei  =  (1, 0, . . . ,  0), . . . ,  =  (0, . . . ,  0, 1). 

Given  a  <  b,  we  let  [a,  b]  and  ]a,  b[  denote  the  closed  interval  and  the  open 
interval  between  a  and  6,  respectively. 

Given  two  sets  S  and  T,  we  let  /  :  S  — >  T  denote  a  map  from  S  to  T, 
i.e. ,  a  unique  way  of  associating  an  element  of  T  to  an  element  of  S.  Given 
/  :  S  — >  T  and  Si  C  S,  we  let  /(Si)  denote  the  image  set  {/(s)  |  s  E  Si}. 
Given  /  :  S  T  and  g  :  T  ->  U,  we  let  /  o  g  :  S  -4  U,  f  o  g  (s)  =  f(g(s)), 
denote  the  composition  of  /  and  g.  The  map  idg  :  S  — >  S  is  the  identity 
map  on  S.  Given  /  :  S  — >  R,  the  support  of  /  is  the  set  of  elements  s  such 
that  f(s)  /  0.  The  indicator  function  I5  :  S  — >  R  associated  with  S  is  given 
by  1  s{q)  =  1  if  q  £  S,  and  1s(q)  =  0  if  q  0  S.  Given  two  sets  S  and  T, 
a  set-valued  map,  denoted  by  h  :  S  T,  associates  to  an  element  of  S  a 
subset  of  T.  Given  a  map  /  :  S  — >  T,  the  inverse  map  f  :  T  ^  S  is 
defined  by 

f  L{t)  =  is  £  S  |  f(s)  =  t}. 

If  /  is  a  real-valued  function,  then  /_1(x),  for  any  x  E  R,  is  a  level  set  of  /. 

The  projection  of  a  point  q  £  M.d  onto  a  set  S  C  is  defined  by  the 
set-valued  map  projs  :  S.  This  map  assigns  to  every  point  q  E  the 

set  projg^)  =  {p  E  S  \  dist2(g,p)  =  dist2(<?,  51)}.  If  S  is  a  closed  set,  then 
proj^q)  7^  0  for  any  q  E  W1. 
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Finally,  we  introduce  the  so-called  Bachmann- Landau  symbols.  For  f,g  : 
N  — »  R,  we  say  that  /  £  O(g)  (respectively,  /  £  Ll(g))  if  there  exist  no  £  N 
and  K  £  R>o  (respectively,  k  £  R>o)  such  that  |/(n)|  <  K\g(n)\  for  all 
n  >  no  (respectively,  |/(n)|  >  k\g(n)\  for  all  n  >  no).  If  /  £  0(g)  and 
/  £  Ll(g),  then  we  use  the  notation  /  £  0(g). 


1.1.2  Curves 

In  a  topological  space  X ,  a  (continuous)  curve  C  is  the  image  of  a  continuous 
map  7  :  [a,  b]  — >  X.  The  map  7  is  called  a  parameterization  of  C.  We  usually 
identify  a  parameterization  with  the  curve  it  defines.  Note  that,  without  loss 
of  generality,  we  can  take  a  =  0  and  6  =  1.  A  curve  connects  two  points  p 
and  q  if  7(0)  =  p  and  7(1)  =  q.  The  length  of  a  continuous  and  piecewise 
continuously  differentiable  curve  7  is 

length(7)  =  [  ^(t^dt. 

Jo 

A  curve  7  :  [0, 1]  — >  X  is  not  self-intersecting  if  7  is  injective  on  (0, 1).  A 
curve  is  closed  if  7(0)  =  7(1).  For  a  not  self-intersecting  and  closed  curve, 
we  define  the  inward  unit  normal  vector  rijn7  and  the  outward  unit  normal 
vector  nout)7. 

Consider  a  curve  7  :  [0, 1]  — >  M.d  in  Euclidean  space.  Define  the  arg-length 
parameter  by 

7arc (t)  =  [  W^u^du. 

Jo 

Note  that  as  the  parameter  t  varies  in  [0, 1],  the  arc-length  parameter  7arc(f) 
varies  in  [0,  length^)].  The  arc-length  parameterization  of  the  curve  is  the 
map  7arc  :  [0,  length^)]  — >  R3  defined  by  the  equation  7arc(7arc(t))  =  7 (*)■ 
Note  that  llv^lh  =  1-  With  a  slight  abuse  of  notation,  we  will  often  drop 
the  subindex  arc  and  denote  the  arc- length  parameterization  by  7  too. 

For  closed,  not  self-intersecting  curves  in  the  plane,  we  define  the  signed 
and  absolute  curvatures  as  follows.  Consider  the  (counterclockwise)  arc- 
length  parameterization  7  :  [0,  length(7)]  — >  M2.  For  each  7arc  £  [0,  length^)], 
denote  by  7/(7arc)  and  nout,7(7arc)  the  tangent  vector  and  the  outward  unit 
normal  vector  to  the  curve  at  7(7^) •  With  these  notations,  the  signed 
curvature  Ksigneci  :  [0,  length^)]  — >  R>o  is  defined  by  requiring  it  satisfies 

7  (Tare)  =  ^signed (7arc)  nout, 7 (7arc); 
nout,7  (Tare)  =  —  Ksigned  (Tare  )  T  (Tare)- 
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The  (absolute)  curvature  «abs  :  [0,  length(7)]  — *  M>o  and  the  radius  of  cur¬ 
vature  p  :  [0,  length(7)]  — >  M>o  of  the  curve  are  then  defined  by 

^abs  (Tare  )  =  I  ^signed  (Tare  )  |  j 
P(Tarc)  =  |  ^signed  (Tare)  | 


A  set  S  C  X  is  path  connected  if  any  two  points  in  S  can  be  joined  by 
a  curve.  A  set  S  C  X  is  simply  connected  if  it  is  path  connected  and  any 
non  self-intersecting  closed  curve  can  be  continuously  deformed  to  a  point 
in  the  set,  i.e. ,  for  any  injective  continuous  map  y  :  [0, 1]  — ; ►  S  that  satisfies 
y(0)  =  7(1),  there  exist  p  E  S  and  a  continuous  map  H  :  [0, 1]  x  [0, 1]  — ►  S 
such  that  H(t,  0)  =  7 (t)  and  H(t,  1)  =  p  for  all  t  E  [0,1].  Informally,  a 
simply  connected  set  is  a  set  that  consists  of  a  single  piece  and  does  not 
have  any  holes. 


1.1.3  Distance  functions 

A  function  dist  :  S  x  S  —>  M>o  defines  a  distance  on  a  set  S  if  it  satisfies: 
(i)  dist(x,y)  =  0  if  and  only  if  x  =  y,  (ii)  dist(x,y)  =  dist (y,x)  for  all 
x,y  E  S,  and  (iii)  dist(x,y)  <  dist(x,z)  +  dist(z,  y),  for  all  x,y,z  E  S.  The 
pair  ( S ,  dist)  is  usually  called  a  metric  space. 

Some  relevant  examples  of  distance  functions  include  the  following: 


Lp-distance  on  Wl.  For  p  E  [l,+oo[,  consider  the  Lp-norm  on  defined 
by  ||x||p  =  (Y)i= 1  | Xi\p)l/p .  For  p  =  +00,  consider  the  L°°-norm  on 
M.d  defined  by  ||a;]|c»  =  max^n  ^i  \xf\.  Any  of  these  norms  defines 
naturally  a  Lp -distance  in  Wl  by  distp(a:,  y)  =  ||y  —  x\\p.  In  partic¬ 
ular,  the  most  widely  used  is  the  Euclidean  distance,  corresponding 
to  p  =  2.  Unless  otherwise  noted,  it  is  always  understood  that  M.d  is 
endowed  with  this  notion  of  distance.  We  will  also  use  the  L1-  and 
the  L°°-distances.  Finally,  it  is  convenient  to  define  the  norm  ||z||c  of 
a  complex  number  z  E  C  to  be  the  Euclidean  norm  of  2  regarded  as  a 
vector  in  R2; 

Geodesic  distance  on  §d.  Another  example  is  the  notion  of  geodesic  dis¬ 
tance  on  §d.  This  is  defined  as  follows.  For  x,y  E  §d,  dists(x,y)  is 
the  length  of  the  shortest  curve  in  connecting  x  and  y.  We  will 
use  this  notion  of  distance  in  dimensions  d  =  1  and  d  =  2.  On  the 
unit  circle  S1,  by  convention,  let  us  define  positions  as  angles  measured 
counterclockwise  from  the  positive  horizontal  axis.  Then,  the  geodesic 
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distance  can  be  expressed  as 

distg(x,  y)  =  min{distc(x,  y),distcc(x,y)},  a^yES1, 

where  distc(x,y)  =  (x  —  y)  mod  2-jt  is  the  clockwise  distance  and 
distcc(£,y)  =  (y  —  x)  mod  2-7T  is  the  counterclockwise  distance.  Here 
the  clockwise  distance  between  two  angles  is  the  path  length  from  an 
angle  to  the  other  traveling  clockwise,  and  x  mod  2-/r  is  the  remainder 
of  the  division  of  x  by  27t.  On  the  sphere  §2,  the  geodesic  distance  can 
be  computed  as  follows.  Given  x,  y  E  §2,  one  considers  the  great  circle 
determined  by  x  and  y.  Then,  the  geodesic  distance  between  x  and  y 
is  exactly  the  length  of  the  shortest  arc  in  the  great  circle  connecting 
x  and  y; 

Cartesian  product  distance  on  Rdl  x  Ed2.  Consider  M.dl  endowed  with 
an  Lp-distance,  p  E  [l,+oo],  and  Ed2  endowed  with  the  geodesic  dis¬ 
tance.  Then,  one  can  define  the  Cartesian  product  distance  on  Rrfl  x 
Ed2  by  (distp,dists)((xi,yi),  (x2ly2))  =  distp(xi,  x2)  +  dist9(yi,  y2)  for 
(xi,yi),  (x2,y2)  E  R^1  x  Ed2.  Unless  otherwise  noted,  it  is  always  un¬ 
derstood  that  Rdl  x  S^2  is  endowed  with  the  Cartesian  product  distance 
(dist2,  dist9). 


Given  a  metric  space  (5,  dist),  the  open  and  closed  ball  of  center  x  E  S 
and  radius  e  E  R>o  are  defined  by,  respectively, 

B(x,s)  =  {y  E  S  |  dist(x,y)  <  e}, 

B(x,e)  =  {y  E  S  |  dist(a;,y)  <  e}. 

Consider  a  point  and  a  set  S  C  X.  A  neighborhood  of  a  point  x  E  X 

is  a  subset  of  X  that  contains  an  open  ball  centered  at  x.  A  neighborhood  of 
a  set  Y  C  X  is  a  subset  of  X  that,  for  each  point  y  E  Y,  contains  an  open 
ball  centered  at  y. 

The  open  lune  associated  to  x,  y  E  S  is  B{x,  dist(x,  y))  n  B(y,  dist(x,  y)). 
These  notions  are  illustrated  in  Figure  1.1  for  the  Euclidean  distance  on  the 
plane. 

The  distance  between  a  point  x  E  S  and  a  set  W  E  R(S')  is  the  inhmum 
of  all  distances  between  x  and  each  of  the  points  in  W.  Formally,  we  set 

dist(x,!U)  =  inf{dist(x,  y)  |  y  E  W}. 

The  diameter  of  a  set  is  the  maximum  distance  between  any  two  points  in 
the  set.  Formally,  we  set  diam(5)  =  sup{dist(x,  y)  |  x,  y  E  5}.  With  a  slight 
abuse  of  notation,  we  often  use  diam(P)  to  denote  diam({pi, . . .  ,pn})  for 
P={Pl,...,Pn)  E  ( Rdr . 
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Figure  1.1  Open  balls  (dashed  lines),  closed  ball  (solid  line),  and  open  lune  for  the  Eu¬ 
clidean  distance  on  the  plane. 

1.2  MATRIX  THEORY 

Here  we  present  basic  notions  and  results  about  matrix  theory,  following  the 
treatments  in  [Horn  and  Johnson,  1985]  and  [Meyer,  2001].  We  let  Wixm 
and  Cnxm  denote  the  set  of  n  x  m  real  and  complex  matrices.  Given  a  real 
matrix  A  and  a  complex  matrix  U,  we  let  AT  and  U*  denote  the  transpose 
of  A  and  the  conjugate  transpose  matrix  of  U ,  respectively.  We  let  In 
denote  the  n  x  n  identity  matrix.  For  a  square  matrix  A,  we  write  A  >  0, 
resp.  A  >  0,  if  A  is  symmetric  positive  definite,  resp.  symmetric  positive 
semidehnite.  For  a  real  matrix  A,  we  let  kernel(A)  and  rank(A)  denote  the 
kernel  and  rank  of  A,  respectively.  Given  a  vector  v,  we  let  diag(w)  denote 
the  square  matrix  whose  diagonal  elements  are  equal  to  the  component  v 
and  whose  off-diagonal  elements  are  zero. 


1.2.1  Matrix  sets 


A  matrix  A  £  Mnxn  with  entries  aij,  i,j  £  {1, . . . ,  n},  is 

(i)  orthogonal  if  AAT  =  In ,  and  is  special  orthogonal  if  it  is  orthogonal 
with  det(.A)  =  +1.  The  set  of  orthogonal  matrices  is  a  group;1 

(ii)  nonnegative  (resp.,  positive )  if  all  its  entries  are  nonnegative  (resp., 
positive); 

(iii)  row-stochastic  (or  stochastic  for  brevity)  if  it  is  nonnegative  and 
J2j=iaij  =  1)  for  all  i  £  {l,...,n};  in  other  words,  A  is  row- 
stochastic  if 

41  —  1  • 

-ri-Ln  —  J-ri) 

(iv)  doubly  stochastic  if  it  is  row-stochastic  and  column-stochastic,  where 


1A  set  G  with  a  binary  operation,  denoted  by  G  x  G  3  (a,  b)  >  a  *  b  G  G,  is  a  group  if  (i) 
a  *  (6  ★  c)  =  (a*  b)  *  c  for  all  a,b,c  G  G  (associativity  property);  (ii)  there  exists  e  G  G  such  that 
a-ke  =  e-ka  =  a  for  all  a  G  G  (existence  of  an  identity  element);  and  (iii)  there  exists  a-1  G  G 
such  that  a  *  a-1  =  a-1  *  a  =  e  for  all  a  G  G  (existence  of  inverse  elements). 
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we  say  that  A  is  column- stochastic  if  =  1%; 

(v)  normal  if  AT A  =  AAT\ 

(vi)  a  permutation  matrix  if  A  has  precisely  one  entry  equal  to  one  in 
each  row,  one  entry  equal  to  one  in  each  column,  and  all  other 
entries  equal  to  zero.  The  set  of  permutation  matrices  is  a  group; 
and 

(vii)  irreducible  if,  for  any  nontrivial  partition  J  Li  K  of  the  index  set 
{1, . . . ,  n},  there  exists  j  €  J  and  k  E  K  such  that  /  0. 

Remark  1.1  (Properties  of  irreducible  matrices).  The  property  of 
irreducibility  depends  only  upon  the  patterns  of  zeros  and  nonzero  elements 
of  the  matrix.  Also,  note  the  following  equivalent  definition  of  irreducibility. 
A  matrix  A  E  Rnx?l  is  irreducible  if  it  is  not  reducible,  and  is  reducible  if 
either 

(i)  n  =  1  and  A  =  0;  or 

(ii)  there  exists  a  permutation  matrix  P  E  Mnxn  and  a  number  r  E 

{ 1 , . . .  ,77.  —  1 }  such  that  PT AP  is  block  upper  triangular  with  diag¬ 
onal  blocks  of  dimensions  r  x  r  and  (n  —  r)  x  (n  —  r).  • 

The  scalars  /j,i,  ■  ■  ■  ,Hk  are  convex  combination  coefficients  if  >  0,  for 
i  E  {l,...,/c},  and  Yli=i  IP  =  1-  (Each  row  of  a  row-stochastic  matrix 
contains  convex  combination  coefficients.)  A  convex  combination  of  vectors 
is  a  linear  combination  of  the  vectors  with  convex  combination  coefficients. 
A  subset  U  of  a  vector  space  V  is  convex  if  the  convex  combination  of  any 
two  elements  of  U  takes  value  in  U.  For  example,  the  set  of  stochastic 
matrices  and  the  set  of  doubly  stochastic  matrices  are  convex. 

Theorem  1.2  (Birkhoff— Von  Neumann).  A  square  matrix  is  doubly 
stochastic  if  and  only  if  it  is  a  convex  combination  of  permutation  matri¬ 
ces. 

Next,  we  review  two  families  of  relevant  matrices  with  useful  properties. 
Toeplitz  matrices  are  square  matrices  with  equal  entries  along  each  diagonal 
parallel  to  the  main  diagonal.  In  other  words,  a  Toeplitz  matrix  is  a  matrix 
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of  the  form 


to  1 1 

t- 1  to  t\ 

t-  i  to 

f_i 

t— n+2 

t—n+l  t— n+2 


tn—2  tj j—1 

tn—2 

t\ 

to  ti 

t-1  to  tl 

'•  t_i  to  tl 

t_i  to 


An  n  x  n  Toeplitz  matrix  is  determined  by  its  first  row  and  column,  hence 
by  2  re  —  1  scalars. 


Circulant  matrices  are  square  Toeplitz  matrices  where  each  two  subse¬ 
quent  row  vectors  Vi  and  Vi+i  have  the  following  two  properties:  the  last 
entry  of  Vi  is  the  first  entry  of  Vi+\  and  the  first  (n  —  1)  entries  of  Vi  are  the 
second  (n—  1)  entries  of  Vi+  \ .  In  other  words,  a  circulant  matrix  is  a  matrix 
of  the  form 


Co 

Cl 

Cn—  2 

Cn—  1 

Cn— 1 

co 

Cl 

Cn— 2 

Cn—  1 

Co 

Cl 

Cn—  1 

Co 

Cl 

Cn—  1 

co 

Cl 

C2 

Cn— 1 

co 

Cl 

Cl 

C2 

Cn—  1 

CO  . 

and,  therefore,  it  is  determined  by  its  first  row. 


1.2.2  Eigenvalues,  singular  values,  and  induced  norms 

We  require  the  reader  to  be  familiar  with  the  notion  of  eigenvalue  and  of 
simple  eigenvalue  (i.e.,  an  eigenvalue  with  algebraic  and  geometric  multi¬ 
plicity  equal  to  1).  The  set  of  eigenvalues  of  a  matrix  A  E  Mnxn  is  called  its 
spectrum  and  is  denoted  by  spec(A)  C  C.  The  singular  values  of  the  matrix 
A  E  MTlxn  are  the  positive  square  roots  of  the  eigenvalues  of  AT A. 

We  begin  with  a  well-known  property  of  the  spectrum  of  a  matrix. 
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Theorem  1.3  (Gersgorin  Disks).  Let  A  be  an  n  x  n  matrix.  Then 

n 

spec(A)  C  |J  jz  €  C  |  \\z  -  au\\c  <  y  |ay|}. 

ie{l,...,n}  3=1  ,j^i 

Next,  we  review  a  few  facts  about  normal  matrices  and  their  singular 
values. 

Lemma  1.4  (Normal  matrices).  For  a  matrix  A  £  Mnxn,  the  following 
statements  are  equivalent: 

(i)  A  is  normal; 

(ii)  A  has  a  complete  orthonormal  set  of  eigenvectors;  and 

(Hi)  A  is  unitarily  similar  to  a  diagonal  matrix,  i.e.,  there  exists  a  uni¬ 
tary2  matrix  U  such  that  U*AU  is  diagonal. 

Lemma  1.5  (The  singular  values  of  a  normal  matrix).  If  a  normal 
matrix  has  eigenvalues  { Ai , . . . ,  An},  then  its  singular  values  are  {|Ai|, . . . ,  |An|}. 

It  is  well  known  that  real  symmetric  matrices  are  normal,  are  diagonaliz- 
able  by  orthogonal  matrices,  and  have  real  eigenvalues.  Additionally,  circu- 
lant  matrices  are  normal. 

We  conclude  defining  the  notion  of  induced  norm  of  a  matrix.  For  p  £ 
NU{oo},  the  p-induced  norm  of  A  £  M.nxn  is 

||A||p  =  max{||Ax||p  |  ||£c||p  =  1}. 

One  can  see  that 

n  n 

1=  max  y;  | ajj | ,  ||A||oo  =  max  y  \ajj\, 

je{  l,~,n}~{  *e{l,...,n}“ 

1 1 ^4 1| 2  =  maxjer  |  a  is  a  singular  value  of  A}. 


1.2.3  Spectral  radius  and  convergent  matrices 

The  spectral  radius  of  a  matrix  A  £  Mnxn  is 

p(A)  =  max{||A||c  |  A  £  spec(A)}. 

In  other  words,  p(A)  is  the  radius  of  the  smallest  disk  centered  at  the  origin 
that  contains  the  spectrum  of  A. 

2 A  complex  matrix  U  S  Cnxn  is  unitary  if  C/_1  =  U* . 

9 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


Lemma  1.6  (Induced  norms  and  spectral  radius).  For  any  square 
matrix  A  and  in  any  norm  p  G  N  U{oo},  p(A)  <  ||A||p. 

We  will  often  deal  with  matrices  with  an  eigenvalue  equal  to  1  and  all 
other  eigenvalues  strictly  inside  the  unit  disk.  Accordingly,  we  generalize  the 
notion  of  spectral  radius  as  follows.  For  a  square  matrix  A  with  p(A)  =  1, 
we  define  the  essential  spectral  radius 

pess(A)  =  max{||A||c  |  A  G  spec(A)  \  {1}}.  (1.2.1) 


Next,  we  consider  matrices  with  useful  convergence  properties. 

Definition  1.7  (Convergent  and  semi-convergent  matrices).  A  ma¬ 
trix  A  G  MnXTl  is 

(i)  semi-convergent  if  lirri£_>+00  A(  exists;  and 

(ii)  convergent  if  it  is  semi-convergent  and  lim^+oc  A1  =  0. 

These  two  notions  are  characterized  as  follows. 

Lemma  1.8  (Convergent  and  semi-convergent  matrices).  The  square 
matrix  A  is  convergent  if  and  only  if  p(A)  <  1.  Furthermore,  A  is  semi- 
convergent  if  and  only  if  the  following  three  properties  hold: 

(i)  p{A)  <  1; 

(ii)  /9ess  (A)  <  1,  that  is,  1  is  an  eigenvalue  and  1  is  the  only  eigenvalue 
on  the  unit  circle;  and 

(in)  the  eigenvalue  1  is  semisimple,  i.e.,  it  has  equal  algebraic  and  geo¬ 
metric  multiplicity  (possibly  larger  than  one). 


In  other  words,  A  is  semi-convergent  if  and  only  if  there  exists  a  nonsin¬ 
gular  matrix  T  such  that 


A  =  T 


h 

0 


0 

B 


T 


-i 


where  B  G  M(n-fc)x(n-fc)  is  convergent,  that  is,  p(B)  <  1.  With  this  notation, 
we  have  pess(A)  =  p(B)  and  the  algebraic  and  geometric  multiplicity  of  the 
eigenvalue  1  is  k. 


1.2.4  Perron-Frobenius  theory 

Positive  and  nonnegative  matrices  have  useful  spectral  properties.  The  first 
statement  in  the  following  theorem  amounts  to  the  original  Perron’s  The- 
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orem  for  positive  matrices;  the  second  statement  is  the  extension  due  to 
Frobenius  for  nonnegative  matrices. 


Theorem  1.9  (Perron- Frobenius).  If  the  square  matrix  A  is  positive, 
then 

(i)  p(A)  >  0; 

(ii)  p(A)  is  an  eigenvalue,  it  is  simple,  and  p(A)  is  strictly  larger  than 
the  magnitude  of  any  other  eigenvalue;  and 

(Hi)  p(A)  has  an  eigenvector  with  positive  components. 

Furthermore,  if  the  square  matrix  A  is  nonnegative  and  irreducible,  then 

(i)  p(A)  >  0; 

(ii)  p(A)  is  an  eigenvalue,  and  it  is  simple;  and 

(Hi)  p(A)  has  an  eigenvector  with  positive  components. 


Note  that,  in  general,  the  spectral  radius  of  a  nonnegative  irreducible 
matrix  does  not  need  to  be  the  only  eigenvalue  of  maximum  magnitude.  For 

^  1  has  eigenvalues  {1,-1}.  It  is  useful,  therefore, 


example,  the  matrix 


1  0 


to  introduce  a  sharper  characterization  of  nonnegative  irreducible  matrices. 


Definition  1.10  (Primitive  matrix).  A  nonnegative  square  matrix  A  is 
primitive  if  there  exists  k  £  N  such  that  Ak  is  positive.  • 


It  is  easy  to  see  that  if  a  matrix  is  reducible,  then  it  cannot  be  primitive; 
or  in  other  words,  if  A  is  primitive,  then  it  must  be  irreducible.  The  second 
part  of  the  Perron-Frobenius  Theorem  1.9  can  now  be  sharpened  as  follows. 

Theorem  1.11  (Perron-Frobenius  for  primitive  matrices).  If  a  non¬ 
negative  matrix  is  primitive,  then  its  spectral  radius  is  its  only  eigenvalue  of 
maximum  magnitude. 


We  conclude  this  section  by  noting  the  following  convergence  properties:  If 
A  is  positive  or  A  is  nonnegative  and  primitive,  then  Lemma  1.8  guarantees 
that  p{A)~lA  is  semi-convergent. 


1.3  DYNAMICAL  SYSTEMS  AND  STABILITY  THEORY 

In  this  section  we  introduce  some  basic  concepts  about  dynamical  and  con¬ 
trol  systems;  e.g.,  see  [Sontag,  1998,  Khalil,  2002],  We  discuss  stability  and 
attractivity  notions,  and  give  fairly  general  versions  of  the  invariance  princi¬ 
ple.  We  conclude  with  a  treatment  of  set-valued  systems  and  time-dependent 
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systems. 


1.3.1  State  machines  and  dynamical  systems 

Here,  we  introduce  three  classes  of  dynamical  and  control  systems:  (i)  state 
machines  or  discrete-time  discrete-space  dynamical  systems;  (ii)  discrete¬ 
time  continuous-space  control  systems;  and  (iii)  continuous-time  continuous- 
space  control  systems. 

We  begin  with  our  specific  definition  of  state  machine.  A  ( deterministic , 
finite)  state  machine  is  a  tuple  ( A ,  U,X o,  /),  where  A  is  a  finite  set  called 
the  state  space,  U  is  a  finite  set  called  the  input  space,  Xq  C  X  is  the  set 
of  allowable  initial  states,  and  /  :  X  x  U  — ■>  X  is  the  evolution  map.  Given 
an  input  sequence  u  :  Z>o  — >  U,  the  state  machine  evolution  x  :  Z>o  — >  X 
starting  from  x(0)  E  Ao  is  given  by 

x(£  +  l )  =  f(x(£),u(£)),  l  E  Z>0. 

We  will  often  refer  to  a  state  machine  as  a  processor.  Note  that,  in  a  state 
machine,  both  the  state  and  the  input  spaces  are  finite  or  discrete.  Often 
times,  we  will  find  it  useful  to  consider  systems  that  evolve  in  continuous 
space  and  that  are  time-dependent.  Let  us  then  provide  two  additional 
definitions  in  the  next  paragraphs. 

A  (time- dependent)  discrete-time  continuous- space  control  system  is  a  tu¬ 
ple  ( X ,  U,  Xq,  /),  where  X  is  d-dimensional  space  chosen  among  M.d,  §>d,  and 
the  Cartesian  products  Rdl  xS^2,  for  some  di+d2  =  d,  U  is  a  compact  subset 
of  Mm  containing  0m,  Xq  C  X  and  /  :  Z>o  x  X  x  U  J  is  a  continuous 
map.  As  before,  the  individual  objects  X,  U,  Xq  and  /  are  termed  state 
space,  input  space,  allowable  initial  states  and  evolution  map,  respectively. 
Given  an  input  sequence  u  :  Z>o  — >  U,  the  evolution  x  :  Z>o  — »  X  of  the 
dynamical  system  starting  from  x(0)  E  Xq  is  given  by 

x(£+  1)  =  f(£,x(£),u(£)),  l  E  Z>0. 


A  (time- dependent)  continuous-time  continuous-space  control  system  is  a 
tuple  (A,  U,  Xq,/),  where  A  is  d-dimensional  space  chosen  among  Md,  8>d, 
and  the  Cartesian  products  Mdl  x  S^2,  for  some  di  +  d2  =  d,  U  is  a  compact 
subset  of  Wn  containing  0m,  Ao  C  A  and  /  :  M>o  x  X  x  U  — >  TA  is  a 
continuously  differentiable  map.  The  individual  objects  X,  U,  Xq  and  /  are 
termed  state  space,  input  space,  allowable  initial  states  and  control  vector 
field,  respectively.  Given  an  input  function  u  :  M>o  — ►  U,  the  evolution 
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x  :  M>o  — *  X  of  the  dynamical  system  starting  from  x(0)  £  Xq  is  given  by 
x(t)  =  f(t,  x(t),u(t)),  t  £  M>0. 

We  often  consider  the  case  when  the  control  vector  field  can  be  written  as 
f(t,x,u )  =  fo(t,x )  +  fa(t,  x)  ua,  for  some  continuously  differentiable 

maps  /o,/i, . . . ,  fm  :  M>o  x  X  — >  TX.  Each  of  these  individual  maps  is 
called  a  (time- dependent)  vector  field,  and  /  is  said  to  be  a  control-affine 
vector  field.  The  control  vector  field  /  is  driftless  if  f(t,x,  0m)  =  0  for  all 
x  £  X  and  t  £  M>o- 

Finally,  the  term  dynamical  system  denotes  a  control  system  that  is  not 
subject  to  any  external  control  action;  this  terminology  is  applicable  both 
in  discrete  and  continuous  time.  Furthermore,  we  will  sometimes  neglect  to 
define  a  specific  set  of  allowable  initial  states;  in  this  case  we  mean  that  any 
point  in  the  state  space  is  allowable  as  initial  condition. 


1.3.2  Stability  and  attractivity  notions 

In  this  section  we  consider  a  continuous-space  dynamical  system  ( X ,  /).  We 
first  consider  the  discrete-time  case  and  later  we  briefly  present  the  analogous 
continuous-time  case.  We  study  dynamical  systems  that  are  time-invariant. 
In  discrete-time,  a  time-invariant  system  is  simply  described  by  an  evolution 
map  of  the  form  /  :  X  — >  X. 

Definition  1.12  (Equilibrium  point).  A  point  x*  £  X  is  an  equilibrium 
point  for  the  time-invariant  dynamical  system  ( X ,  /)  if  the  constant  curve 
x  :  Z>o  — »  X ,  defined  by  x(£)  =  x*  for  all  £  £  Z>o,  is  an  evolution  of  the 
system.  • 

It  is  immediate  to  see  that  a  point  x*  is  an  equilibrium  point  if  and  only 
if  /(x*)  =  x*.  We  denote  the  set  of  equilibrium  points  of  the  dynamical 
system  by  Equil(A,  /). 

Definition  1.13  (Trajectories  and  sets).  Let  (X,f)  be  a  time-invariant 
dynamical  system  and  let  IT  be  a  subset  of  X. 

(i)  The  set  W  is  positively  invariant  for  (A,  /)  if  each  evolution  with 
initial  condition  in  IT  remains  in  IT  for  all  subsequent  times. 

(ii)  A  trajectory  x  :  Z>o  — ►  X  approaches  a  set  IT  C  X  if,  for  every 

neighborhood  Y  of  IT,  there  exists  a  time  1$  >  0  such  that  x{£) 
takes  values  in  Y  for  all  subsequent  times  £  >  £q.  In  such  a  case,  we 
write  x(£)  — >  IT  as  £  — >  +oo.  • 
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In  formal  terms,  W  is  positively  invariant  if  cc(0)  £  W  implies  x(£)  £  W 
for  all  i  £  Z>o,  where  x  :  Z>o  — >  X  is  the  evolution  of  (X,  /)  starting  from 
x(0). 

Definition  1.14  (Stability  and  attractivity  notions).  For  a  time-invariant 
dynamical  system  ( X ,  /),  a  set  S  is 

(i)  stable  if,  for  any  neighborhood  Y  of  S,  there  exists  a  neighborhood 
W  of  S  such  that  every  evolution  of  ( X ,  /)  with  initial  condition  in 
W  remains  in  Y  for  all  subsequent  times; 

(ii)  unstable  if  it  is  not  stable; 

(iii)  locally  attractive  if  there  exists  a  neighborhood  Y  of  S  such  that 
every  evolution  with  initial  condition  in  Y  approaches  the  set  S'; 
and 

(iv)  locally  asymptotically  stable  if  it  is  stable  and  locally  attractive.  • 

Remark  1.15  (Continuous-time  dynamical  systems).  It  is  straight¬ 
forward  to  extend  the  previous  definitions  to  the  setting  of  continuous-time 
continuous-space  dynamical  systems.  These  notions  are  illustrated  in  Fig¬ 
ure  1.2.  • 


Figure  1.2  Illustrations  of  stability,  asymptotic  stability,  and  instability. 


1.3.3  Invariance  principles 

Before  discussing  various  versions  of  the  invariance  principle,  we  begin  with 
a  useful  notion.  Given  a  discrete-time  time-invariant  continuous-space  dy¬ 
namical  system  (X,/)  and  a  set  W  C  X,  a  function  V  :  X  — »  R  is  non¬ 
increasing  along  f  in  W  if  V(f(x))  <  V(x)  for  all  x  £  W.  (Such  functions 
are  often  referred  to  as  Lyapunov  functions .)  In  other  words,  if  a  function 
V  is  non-increasing  along  /,  then  the  composite  function  i  i— »  V(y(£))  is 
non-increasing  for  each  evolution  y  of  the  dynamical  system  (X,/).  The 
following  theorem  exploits  this  fact  to  establish  useful  properties  of  the  evo¬ 
lutions  of  (X,  /). 
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Theorem  1.16  (LaSalle  Invariance  Principle  for  discrete-time  dy¬ 
namical  systems).  Let  (X, /)  be  a  (discrete-time  continuous-space)  time- 
independent  dynamical  system.  Assume  that 

(i)  there  exists  a  set  W  C  X  that  is  positively  invariant  for  (X,  /); 

(ii)  there  exists  a  function  V  :  X  — >  M  that  is  non-increasing  along  f 
on  W; 

(in)  all  evolutions  of  ( X,f )  with  initial  conditions  in  W  are  bounded; 
and 

(iv)  f  and  V  are  continuous  on  W . 

Then  all  evolutions  with  initial  conditions  in  W  approach  a  set  of  the  form 
F~1(c)nS',  where  c  is  a  real  constant  and  S  is  the  largest  positively  invariant 
set  contained  in  {p  G  W  \  V (f(p))  =  V(p)}. 

We  refer  to  Section  1.8.1  for  a  discussion  about  the  proof  of  this  result. 
Next,  we  present  the  continuous-time  version  of  the  invariance  principle.  In 
other  words,  we  now  assume  that  ( X ,  /)  is  a  continuous-time  time-invariant 
continuous-space  dynamical  system. 

We  begin  by  revisiting  the  notion  of  non-increasing  function.  Given  a 
continuously  differentiable  function  V  :  X  — >  R,  the  Lie  derivative  of  V 
along  /,  denoted  by  CfV  :  X  — ►  R,  is  defined  by 

C,V(X)  = 

where  the  trajectory  7  :  ]  —  e,  e[  — >  X  satisfies  7 (i)  =  /(7(f))  and  7(0)  =  x. 
If  X  =  M.d ,  then  we  can  write  x  in  components  (xi, . . . ,  Xd)  and  we  can  give 
the  following  explicit  formula  for  the  Lie  derivative: 

d  r)V 

£fv(x)  = 

i= 1 

Similar  formulas  can  be  obtained  for  more  general  state  spaces.  Next,  given 
a  set  W  C  X,  a  function  V  :  X  — >  R  is  non-increasing  along  f  in  W  if 
jCfV(x)  <  0  for  all  x  €  W. 

Finally,  we  state  the  invariance  principle  for  continuous-time  systems. 

Theorem  1.17  (LaSalle  Invariance  Principle  for  continuous-time 
dynamical  systems).  Let  (X,/)  be  a  (continuous-time  continuous-space) 
time-independent  dynamical  system.  Assume  that 

(i)  there  exists  a  set  W  C  X  that  is  positively  invariant  for  (X,  /); 
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(ii)  there  exists  a  function  V  :  X  — ►  R  that  is  non-increasing  along  f 
on  W; 

(in)  all  evolutions  of(X,f)  with  initial  conditions  in  W  are  bounded; 
and 

(iv)  f  and  V  are  continuously  differentiable 3  on  W. 

Then  all  evolutions  with  initial  conditions  in  W  approach  a  set  of  the  form 
R-1(c)nS';  where  c  is  a  real  constant  and  S  is  the  largest  positively  invariant 
set  contained  in  {p  £  W  \  CfV(p)  =  0}. 


1.3.4  Notions  and  results  for  set-valued  systems 

Next,  we  focus  on  a  more  sophisticated  version  of  the  LaSalle  Invariance 
Principle  for  more  general  dynamical  systems,  that  is,  dynamical  systems 
described  by  set-valued  maps  that  allow  for  non-deterministic  evolutions.  To 
do  so,  we  need  to  present  numerous  notions  including  set-valued  dynamical 
systems,  closedness  properties,  and  weak  positive  invariance. 

Specifically,  a  discrete-time  continuous- space  set-valued  dynamical  system 
(in  short,  set-valued  dynamical  system )  is  determined  by  a  tuple  (X,  Xq,T), 
where  X  is  a  d-dimensional  space  chosen  among  M  ,  E>d,  and  the  Cartesian 
products  Mrfl  x  §d2 ,  for  some  d\  +  di  =  d,  Xq  C  X  and  T  :  X  =4  X  is  a 
set-valued  map.  We  assume  that  T  assigns  to  each  point  x  £  X  a  nonempty 
set  T(x)  C  X.  The  individual  objects  X ,  Xq  and  T  are  termed  state  space , 
allowable  initial  states  and  evolution  map,  respectively.  An  evolution  of  the 
dynamical  system  (X,Xq ,  T)  is  any  trajectory  x  :  Z>o  — >  X  satisfying 

x(£  +  l  )£T(i(1)),  feZ>0. 

Figure  1.3  illustrates  this  notion.  In  particular,  a  (time-invariant)  discrete¬ 
time  continuous-space  dynamical  system  ( X ,  Ao,  /)  can  be  seen  as  a  discrete¬ 
time  continuous-space  set-valued  dynamical  system  (X,  Xq,T),  where  the 
evolution  set-valued  map  is  just  the  singleton- valued  map  x  t —>  T(x)  = 
{/(*)}■ 

Next,  we  introduce  a  notion  of  continuity  for  set-valued  maps.  The  evolu¬ 
tion  map  T  is  said  to  be  closed  at  x  E  X  if,  for  any  sequences  {xk  \  k  £  Z> o} 
and  {yk  \  k  £  Z>o }  such  that 

lim  xk  =  x,  lim  yk  =  y,  and  yk^T{xk), 

&— ►+ oo  k—>+ oo 

it  holds  that  y  £  T(x).  The  evolution  set-valued  map  T  is  closed  atWcX 


3It  suffices  that  /  be  locally  Lipschitz  and  V  be  continuously  differentiable;  see  [Cortes,  2008a]. 
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Figure  1.3  Discrete-time  continuous-space  set- valued  dynamical  system.  A  sample  evolu¬ 
tion  is  shown  dashed. 


if  for  any  x  £  W,  T  is  closed  at  x.  Note  that  a  continuous  map  /  :  X  — >  X 
is  closed  when  viewed  as  a  singleton- valued  map. 

(i)  A  set  C  C  X  is  weakly  positively  invariant  with  respect  to  T  if,  for 
any  x  £  C,  there  exists  y  £  C  such  that  y  £  T(x). 

(ii)  A  set  C  C  X  is  strongly  positively  invariant  with  respect  to  T  if 
T{x)  C  C  for  any  x  £  C. 

A  point  xq  is  said  to  be  a  fixed  point  of  T  if  xo  £  T(x o).  A  continuous 
function  V  :  X  — >  M  is  non-increasing  along  T  in  W  C  A  if  V(y)  <  V(x) 
for  all  x  £  W  and  y  £  T(x). 

We  finally  state  and  prove  a  general  version  of  the  invariance  principle, 
whose  proof  is  presented  in  Section  1.8.1. 

Theorem  1.18  (LaSalle  Invariance  Principle  for  set-valued  dis¬ 
crete-time  dynamical  systems).  Let  ( X ,  Xq,  T )  be  a  (discrete- time  continuous- 
space)  set-valued  dynamical  system.  Assume  that 

(i)  there  exists  a  set  W  C  X  that  is  strongly  positively  invariant  for 

(X,X0,T); 

(ii)  there  exists  a  function  V  :  X  — >  R  that  is  non-increasing  along  T 
on  W ; 

(Hi)  all  evolutions  of(X,  Xq,  T)  with  initial  conditions  in  W  are  bounded; 
and 

(iv)  T  is  nonempty  and  closed  at  W  and  V  is  continuous  on  W. 

Then  all  evolutions  with  initial  conditions  in  W  approach  a  set  of  the  form 
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V  1(c)  n  S,  where  c  is  a  real  constant  and  S  is  the  largest  weakly  positively 
invariant  set  contained  in  {p  E  W  \  3p'  E  T(p)  such  that  V(p')  =  V(p)}. 


1.3.5  Notions  and  results  for  time-dependent  systems 

In  this  final  subsection  we  consider  time-dependent  discrete-time  dynamical 
systems  and  discuss  uniform  stability  and  convergence  notions.  We  begin 
with  some  uniform  boundedness,  stability  and  attractivity  definitions. 

In  what  follows,  given  a  time-dependent  discrete-time  dynamical  system 
(X,  Xo,/),  an  evolution  with  initial  condition  in  W  at  time  £q  E  Z>o  is  a 
trajectory  x  :  [-fo,+oo[  — ’ ►  X  of  the  dynamical  system  (X,  Xo , /)  defined  by 
the  initial  condition  x(£o)  =  xo,  for  some  xq  E  W.  In  other  words,  for  time- 
dependent  systems  we  will  often  consider  trajectories  that  begin  at  time  £q 
not  necessarily  equal  to  0. 

Definition  1.19  (Uniformly  bounded  evolutions).  A  time-dependent 
discrete-time  dynamical  system  ( X ,  Xo,  /)  has  uniformly  bounded  evolutions 
if,  given  any  bounded  set  Y,  there  exists  a  bounded  set  W  such  that  every 
evolution  with  initial  condition  in  Y  at  any  time  £q  E  Z>o,  remains  in  W 
for  all  subsequent  times  £  >  £q.  • 

Definition  1.20  (Uniform  stability  and  attractivity  notions).  For  a 

time-dependent  discrete-time  dynamical  system  (X, Xo,/),  the  set  S  is 

(i)  uniformly  stable  for  (X,  Xo,/)  if,  for  any  neighborhood  Y  of  S, 
there  exists  a  neighborhood  W  of  S  such  that  every  evolution  with 
initial  condition  in  W  at  any  time  £q  E  Z>o,  remains  in  Y  for  all 
subsequent  times  £>  £q\ 

(ii)  uniformly  locally  attractive  for  (X,  Xo,  /)  if  there  exists  a  neighbor¬ 
hood  Y  of  S  such  that  every  evolution  with  initial  condition  in  Y 
at  any  time  £q,  approaches  the  set  S  in  the  following  time-uniform 
manner: 

for  all  £q  E  Z>o,  for  all  xq  E  Y,  and  for  all  neighborhoods 
W  of  S,  there  exists  a  single  to  E  Z>o  such  that  the 
evolution  x  :  [f?o,+oo[  — >  X  defined  by  x{£q)  =  xo,  takes 
value  in  W  for  all  times  £  >  £q  +  to;  and 

(iii)  uniformly  locally  asymptotically  stable  if  it  is  uniformly  stable  and 

uniformly  locally  attractive.  • 

With  the  same  notation  in  the  definition,  the  set  S  is  (non-uniformly) 
locally  attractive  if  for  all  £q  E  Z>o,  xq  E  Y,  and  neighborhoods  W  of  S,  the 
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evolution  x  :  [£q,  +oo[  — >  X  defined  by  x(£o)  =  xo,  takes  value  in  W  for  all 
times  £  >  £o  +  tq(£o),  for  some  tq{£$)  G  Z>o- 

To  establish  uniform  stability  and  attractivity  results  we  will  overapprox¬ 
imate  the  evolution  of  the  time-dependent  dynamical  system  by  considering 
the  larger  set  of  evolutions  of  an  appropriate  set-valued  dynamical  system. 
Given  a  time-dependent  evolution  map  /  :  Z>o xX  X,  define  a  set- valued 
overapproximation  map  Tf  :  X  X  by 

Tf(x)  =  {f(£,x)  |  £  G  Z>0}. 

With  this  notion  we  can  state  a  useful  result,  whose  proof  is  left  to  the 
reader  as  an  exercise. 

Lemma  1.21  (Overapproximation  Lemma).  Consider  a  time- dependent 
discrete-time  dynamical  system  (X,Xq  ,/). 

(i)  If  x  :  \£$,  Too  [  — >  X  is  an  evolution  of  the  dynamical  system  ( X ,  f), 
then  y  :  Z>o  — ►  X  defined  by  y{£)  =  x{£  +  £q)  is  an  evolution  of  the 
set-valued  overapproximation  system  ( X,Tj ). 

(ii)  If  the  set  S  is  locally  attractive  for  the  set-valued  overapproximation 
system  (X,Tj),  then  it  is  uniformly  locally  attractive  for  {X,  /). 

In  other  words,  every  evolution  of  the  time-dependent  dynamical  system 
from  any  initial  time  is  an  evolution  of  the  set-valued  overapproximation  sys¬ 
tem,  and  therefore,  the  set  of  trajectory  of  the  set-valued  overapproximation 
system  contains  the  set  of  trajectory  of  the  original  time-dependent  system. 
Uniform  attractivity  is  a  consequence  of  attractivity  for  the  time-invariant 
set-valued  overapproximation. 

1.4  GRAPH  THEORY 

Here  we  present  basic  definitions  about  graph  theory,  following  the  treat¬ 
ments  in  the  literature;  e.g.,  see  [Diestel,  2005,  Godsil  and  Royle,  2001, 
Biggs,  1994] 

A  directed  graph ,  in  short  digraph ,  of  order  n  is  a  pair  G  =  (V,  E),  where 
V  is  a  set  with  n  elements  called  vertices  (or  nodes)  and  if  is  a  set  of  ordered 
pair  of  vertices  called  edges.  In  other  words,  E  C  V  x  V.  We  call  V  and 
E  the  vertex  set  and  edge  set,  respectively.  When  convenient,  we  let  V(G) 
and  E{G)  denote  the  vertices  and  edges  of  G,  respectively.  For  u,v  G  V, 
the  ordered  pair  (it,  v)  denotes  an  edge  from  u  to  v. 

An  undirected  graph,  in  short  graph ,  consists  of  a  vertex  set  V  and  of  a 
set  E  of  unordered  pairs  of  vertices.  For  u,  v  G  V  and  a/v,  the  set  {u,  v} 
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denotes  an  unordered  edge.  A  digraph  is  undirected  if  (v,  u)  E  E  anytime 
(u,  v )  E  E.  It  is  possible  and  convenient  to  identify  an  undirected  digraph 
with  the  corresponding  graph;  vice  versa,  the  directed  version  of  a  graph 
(V,  E )  is  the  digraph  ( V E')  with  the  property  that  (u,  v )  £  E'  if  and  only 
if  {u,  v}  £  E.  In  what  follows,  our  convention  is  to  allow  self-loops  in  both 
graphs  and  digraphs. 

A  digraph  (V\  E')  is  a  subgraph  of  a  digraph  (V,  E)  if  V'  C  V  and  E'  C  E; 
additionally,  a  digraph  (V7,  E')  is  a  spanning  subgraph  if  it  is  a  subgraph  and 
V'  =  V.  The  subgraph  of  (V,E)  induced  byV'cV  is  the  digraph  (V',E'), 
where  E'  contains  all  edges  in  E  between  two  vertices  in  V' .  For  two  digraphs 
G  =  (V,E)  and  G'  =  (V',E'),  the  intersection  and  union  of  G  and  G’  are 
defined  by 


GnG'  =  (vnv',EnE'), 

GUG'  =  {VUV^EUE1). 


Analogous  definitions  may  be  given  for  graphs. 

In  a  digraph  G  with  an  edge  (u,  v)  £  E,  u  is  called  an  in-neighbor  of  v, 
and  v  is  called  an  out-neighbor  of  u.  We  let  ATq(v),  respectively  A/"^ut(n), 
denote  the  set  of  in-neighbors,  respectively  the  set  of  out-neighbors,  of  v 
in  the  digraph  G.  We  will  drop  the  subscript  when  the  graph  G  is  clear 
from  the  context.  The  in- degree  and  out- degree  of  v  are  the  cardinality 
of  A fm(v)  and  Afout(v ),  respectively.  A  digraph  is  topologically  balanced  if 
each  vertex  has  the  same  in-  and  out-degrees  (even  if  distinct  vertices  have 
distinct  degrees).  Likewise,  in  an  undirected  graph  G,  the  vertices  u  and  v 
are  neighbors  if  {u,  v}  is  an  undirected  edge.  We  let  Afc(v)  denote  the  set 
of  neighbors  of  v  in  the  undirected  graph  G.  As  in  the  directed  case,  we  will 
drop  the  subscript  when  the  graph  G  is  clear  from  the  context.  The  degree 
of  v  is  the  cardinality  of  A f(v). 

Remark  1.22  (Additional  notions).  For  a  digraph  G  =  (V,E),  the  re¬ 
verse  digraph  rev(G)  has  vertex  set  V  and  edge  set  r ev(E)  composed  of  all 
edges  in  E  with  reversed  direction.  A  digraph  G  =  (V,  E)  is  complete  if 
E  =  V  x  V.  A  clique  (V',E')  of  a  digraph  (V,  E)  is  a  subgraph  of  (V,  E) 
which  is  complete,  i.e. ,  such  that  E'  =  V'  x  V' .  Note  that  a  clique  is  fully 
determined  by  its  set  of  vertices,  and  hence,  there  is  no  loss  of  precision  in 
denoting  it  by  V' .  A  maximal  clique  V'  of  an  edge  of  a  digraph  is  a  clique 
of  the  digraph  with  the  following  two  properties:  it  contains  the  edge,  and 
any  other  subgraph  of  the  digraph  that  strictly  contains  (V1,  V'  x  V')  is  not 
a  clique.  • 
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1.4.1  Connectivity  notions 

Let  us  now  review  some  basic  connectivity  notions  for  digraphs  and  graphs. 
We  begin  with  the  setting  of  undirected  graphs  because  of  its  simplicity. 

A  path  in  a  graph  is  an  ordered  sequence  of  vertices  such  that  any  two 
consecutive  vertices  in  the  sequence  are  an  edge  of  the  graph.  A  graph  is 
connected  if  there  exists  a  path  between  any  two  vertices.  If  a  graph  is 
not  connected,  then  it  is  composed  of  multiple  connected  components ,  i.e. , 
multiple  connected  subgraphs.  A  path  is  simple  if  no  vertices  appear  more 
than  once  in  it,  except  possibly  for  initial  and  final  vertex.  A  cycle  is  a 
simple  path  that  starts  and  ends  at  the  same  vertex.  A  graph  is  acyclic  if  it 
contains  no  cycles.  A  connected  acyclic  graph  is  a  tree.  A  forest  is  a  graph 
that  can  be  written  as  the  disjoint  union  of  trees.  Trees  have  interesting 
properties:  for  example,  G  =  (V,  E )  is  a  tree  if  and  only  if  G  is  connected 
and  | E\  =  \V\  —  1.  Alternatively,  G  =  (V,  E)  is  a  tree  if  and  only  if  G  is 
acyclic  and  \E\  =  \V\  —  1.  Figure  1.4  illustrates  these  notions. 


Figure  1.4  Illustration  of  connectivity  notions  on  a  graph.  The  graph  has  two  connected 
components.  The  leftmost  connected  component  is  a  tree,  while  the  rightmost 
connected  component  is  a  cycle. 

Next,  we  generalize  these  notions  to  the  case  of  digraphs.  A  directed  path 
in  a  digraph  is  an  ordered  sequence  of  vertices  such  that  any  two  consecutive 
vertices  in  the  sequence  are  a  directed  edge  of  the  digraph.  A  cycle  in  a 
digraph  is  a  directed  path  that  starts  and  ends  at  the  same  vertex  and  that 
contains  no  repeated  vertex  except  for  initial  and  final  vertex.  A  digraph  is 
acyclic  if  it  contains  no  cycles.  In  an  acyclic  graph,  every  vertex  of  in-degree 
0  is  named  source,  and  every  vertex  of  out-degree  0  is  named  sink.  Every 
acyclic  digraph  has  at  least  one  source  and  at  least  one  sink.  Figure  1.5 
illustrates  these  notions. 

The  set  of  cycles  of  a  directed  graph  is  finite.  A  directed  graph  is  aperiodic 
if  there  exists  no  k  >  1  that  divides  the  length  of  every  cycle  of  the  graph. 
In  other  words,  a  digraph  is  aperiodic  if  the  greatest  common  divisor  of  the 
lengths  of  its  cycles  is  one.  Figure  1.6  shows  an  example  of  a  periodic  and 
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Figure  1.5  Illustration  of  connectivity  notions  on  a  digraph,  (a)  shows  an  acyclic  digraph 
with  one  sink  and  two  sources,  (b)  shows  a  directed  path  which  is  also  a  cycle. 


an  aperiodic  digraph. 


Figure  1.6  (a)  A  periodic  digraph,  (b)  An  aperiodic  digraph  with  cycles  of  lenght  2  and  3. 


A  vertex  of  a  digraph  is  globally  reachable  if  it  can  be  reached  from  any 
other  vertex  by  traversing  a  directed  path.  A  digraph  is  strongly  connected 
if  every  vertex  is  globally  reachable.  The  decomposition  of  a  digraph  into 
its  strongly  connected  components  and  the  notion  of  condensation  digraph 
are  discussed  in  Exercise  El.  12. 

A  directed  tree  (sometimes  called  a  rooted  tree)  is  an  acyclic  digraph  with 
the  following  property:  there  exists  a  vertex,  called  the  root ,  such  that  any 
other  vertex  of  the  digraph  can  be  reached  by  one  and  only  one  directed 
path  starting  at  the  root.  In  a  directed  tree,  every  in- neighbor  of  a  vertex  is 
called  a  parent  and  every  out-neighbor  is  called  a  child.  Two  vertices  with 
the  same  parent  are  called  siblings.  A  successor  of  a  vertex  u  is  any  other 
node  that  can  be  reached  with  a  directed  path  starting  at  u.  A  predecessor  of 
a  vertex  v  is  any  other  node  such  that  a  directed  path  exists  starting  at  it  and 
reaching  v.  A  directed  spanning  tree,  or  simply  a  spanning  tree,  of  a  digraph 
is  a  spanning  subgraph  that  is  a  directed  tree.  Clearly,  a  digraph  contains  a 
spanning  tree  if  and  only  if  the  reverse  digraph  contains  a  globally  reachable 
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vertex.  A  (directed)  chain  is  a  directed  tree  with  exactly  one  source  and  one 
sink.  A  (directed)  ring  digraph  is  the  cycle  obtained  by  adding  to  the  edge 
set  of  a  chain  a  new  edge  from  its  sink  to  its  source.  Figure  1.7  illustrates 
some  of  these  notions. 


Figure  1.7  From  left  to  right,  tree,  directed  tree,  chain,  and  ring  digraphs. 


The  proof  of  the  following  result  is  given  in  Section  1.8.2. 

Lemma  1.23  (Connectivity  in  topologically  balanced  digraphs).  Let 

G  be  a  digraph.  The  following  statements  hold: 

(i)  IfGis  strongly  connected,  then  it  contains  a  globally  reachable  vertex 
and  a  spanning  tree;  and 

(ii)  if  G  is  topologically  balanced  and  contains  either  a  globally  reach¬ 
able  vertex  or  a  spanning  tree,  then  G  is  strongly  connected  and  is 
Eulerian.4 

Given  a  digraph  G  =  (V,E),  an  in-neighbor  of  a  nonempty  set  of  nodes 
U  is  a  node  v  £  V  \  U  for  which  there  exists  an  edge  (v,u)  £  E  for  some 
ueU. 

Lemma  1.24  (Disjoint  subsets  and  spanning  trees).  Given  a  digraph 
G  with  at  least  two  nodes,  the  following  two  properties  are  equivalent: 

(i)  G  has  a  spanning  tree;  and 

(ii)  for  any  pair  of  nonempty  disjoint  subsets  U\,  U2  C  V,  either  XJ\  has 
an  in-neighbor  or  U2  has  an  in-neighbor. 

We  postpone  the  proof  to  Section  1.8.2.  The  result  is  illustrated  in  Fig¬ 
ure  1.8.  We  can  also  state  the  result  in  terms  of  global  reachability:  G  has  a 
globally  reachable  node  if  and  only  if  for  any  pair  of  nonempty  disjoint  sub¬ 
sets  U\,  U2  C  V,  either  U±  has  an  out-neighbor  or  U2  has  an  out-neighbor. 
We  let  the  reader  give  a  proper  definition  of  out-neighbor  of  a  set. 


4 A  graph  is  Eulerian  if  it  has  a  cycle  that  visits  all  the  graph  edges  exactly  once. 
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(a) 


Figure  1.8  Illustration  of  Lemma  1.24.  The  root  of  the  spanning  tree  is  plotted  in  gray.  In 


(a),  the  root  is  outside  the  sets  U\  and  U2-  Because  these  sets  are  non-empty, 
there  exists  a  directed  path  from  the  root  to  a  vertex  in  each  one  of  these  sets. 
Therefore,  both  U\  and  U2  have  in-neighbors.  In  (b),  the  root  is  contained  in 
U\.  Because  U2  is  non-empty,  there  exists  a  directed  path  from  the  root  to 
a  vertex  in  U2,  and,  therefore,  U2  has  in-neighbors.  The  case  when  the  root 
belongs  to  U2  is  treated  analogously. 


1.4.2  Weighted  digraphs 

A  weighted  digraph  is  a  triplet  G  =  (V,  E,  A),  where  the  pair  V  =  {-iq, . . . ,  vn} 
and  E  is  a  digraph,  and  where  the  nonnegative  matrix  A  £  is  a 

weighted  adjacency  matrix  with  the  following  properties:  for  z,  j  £  {1, . . . ,  n}, 
the  entry  cty  >  0  if  ( Vi,Vj )  is  an  edge  of  G,  and  a,3  =  0  otherwise.  In  other 
words,  the  scalars  atj ,  for  all  (vi,  Vj)  £  E,  are  a  set  of  weights  for  the  edges  of 
G.  Note  that  the  edge  set  is  uniquely  determined  by  the  weighted  adjacency 
matrix  and  it  can  be  therefore  omitted.  When  convenient,  we  denote  the 
adjacency  matrix  of  a  weighted  digraph  G  by  A(G).  Figure  1.9  shows  an 
example  of  a  weighted  digraph. 


4 


Figure  1.9  A  weighted  digraph  with  natural  weights. 


A  digraph  G  =  ( V ,  E)  can  be  naturally  thought  of  as  a  weighted  digraph 
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by  defining  the  weighted  adjacency  matrix  A  £  {0,  l}nxn  as 


aij  ~ 


if  (vi,Vj)  £  E, 
otherwise, 


(1.4.1) 


where  V  =  {fi, . . . ,  vn}.  The  adjacency  matrix  of  a  graph  is  the  adjacency 
matrix  of  the  directed  version  of  the  graph.  Reciprocally,  given  a  weighted 
digraph  G  =  (V,  E,  A ),  we  refer  to  the  digraph  (V,  E)  as  the  unweighted  ver¬ 
sion  of  G  and  to  its  associated  adjacency  matrix  as  the  unweighted  adjacency 
matrix.  A  weighted  digraph  is  undirected  if  =  aji  for  all  i,  j  £  {1, . . . ,  n}. 
Clearly,  G  is  undirected  if  and  only  if  A{G)  is  symmetric. 


Numerous  concepts  introduced  for  digraphs  remain  equally  valid  for  the 
case  of  weighted  digraphs,  including  the  connectivity  notions  and  the  defi¬ 
nitions  of  in-  and  out-neighbors. 


Finally,  we  generalize  the  notions  of  in-  and  out-degree  to  weighted  di¬ 
graphs.  In  a  weighted  digraph  G  =  ( V,E,A )  with  V  =  {vi, . . .  ,vn},  the 
weighted  out- degree  and  the  weighted  in-degree  of  vertex  Vi  are  defined  by, 
respectively, 

n  n 

dout{vi)  =  ^  aij,  and  din(vi)  =  ^  a3l. 
j= 1  i=1 

The  weighted  digraph  G  is  weight-balanced  if  dout(vi)  =  d\n{vi)  for  all  Vi  £  V. 
The  weighted  out-degree  matrix  DOVLt(G)  and  the  weighted  in-degree  matrix 
Dm(G )  are  the  diagonal  matrices  defined  by 

Dout(G)  =  diag(Aln),  and  D-m(G)  =  diag(ATl„). 

That  is,  (Dont(G))a  =  dOVLt(vi)  and  {Din(G))a  =  din(vi),  respectively. 


1.4.3  Distances  on  digraphs  and  weighted  digraphs 

We  first  present  a  few  definitions  for  unweighted  digraphs.  Given  a  digraph 
G,  the  (topological)  length  of  a  directed  path  is  the  number  of  the  edges 
composing  it.  Given  two  vertices  u  and  v  in  the  digraph  G,  the  distance 
from  u  to  v,  denoted  distort,  v),  is  the  smallest  length  of  any  directed  path 
from  u  to  v,  or  +oo  if  there  is  no  directed  path  from  u  to  v,  that  is, 

distort,  v)  =  min  ({length(p)  |  p  is  a  directed  path  from  u  to  v}  U{+oo}) . 

Given  a  vertex  v  of  a  digraph  G,  the  radius  of  v  in  G  is  the  maximum  of  all 
the  distances  from  v  to  any  other  vertex  in  G,  that  is, 

radius(u,G)  =  max{distG(u,  u)  \  u  £  R(G)}. 
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If  T  is  a  directed  tree  and  v  is  its  root,  then  the  depth  of  T  is  radius(u,  T). 
Finally,  the  diameter  of  the  digraph  G  is 

diam(G)  =  max{distc;(fx,  v)  \  u,  v  S  17(G)}. 

These  definitions  lead  to  the  following  simple  results: 

(i)  radius(u,  G)  <  diam(G)  for  all  vertices  v  of  G; 

(ii)  G  contains  a  spanning  tree  rooted  at  v  if  and  only  if  radius(u,  G)  < 
+oo;  and 

(iii)  G  is  strongly  connected  if  and  only  if  diam(G)  <  +oo. 

The  definitions  of  path  length,  distance  between  vertices,  radius  of  a  vertex, 
and  diameter  of  a  digraph  can  be  easily  applied  to  undirected  graphs. 

Next,  we  consider  weighted  digraphs.  Given  two  vertices  u  and  v  in  the 
weighted  digraph  G,  the  weighted  distance  from  u  to  v,  denoted  wdistG'(u,  v), 
is  the  smallest  weight  of  any  directed  path  from  u  to  v,  or  +oo  if  there  is  no 
directed  path  from  u  to  v,  that  is, 

wdistc(u,  v)  =  min  ({weight(p)  |  p  is  a  directed  path  from  u  to  U{+oo}) . 

Here,  the  weight  of  a  subgraph  of  a  weighted  digraph  is  the  sum  of  the 
weights  of  all  the  edges  of  the  subgraph.  Note  that  when  a  digraph  is  thought 
of  as  a  weighted  digraph  (with  the  unweighted  adjacency  matrix  (1.4.1)),  the 
notions  of  weight  and  weighted  distance  correspond  to  the  usual  notions  of 
length  and  distance,  respectively.  We  leave  it  the  reader  to  provide  the 
definitions  of  weighted  radius,  weighted  depth,  and  weighted  diameter. 


1.4.4  Graph  algorithms 

In  this  section  we  present  a  few  algorithms  defined  on  graphs.  We  only 
present  high-level  descriptions  and  we  refer  to  [Cormen  et  ah,  2001]  for  a 
comprehensive  discussion. 


1.4.4. 1  Breadth-first  spanning  tree 


Let  v  be  a  vertex  of  a  digraph  G  with  radius(u,G)  <  +oo.  A  breadth-first 
spanning  (BFS)  tree  of  G  with  respect  to  v,  denoted  Tbfs,  is  a  spanning 
directed  tree  rooted  at  v  that  contains  a  shortest  path  from  v  to  every  other 
vertex  of  G.  (Here,  a  shortest  path  is  one  with  shortest  topological  length.) 
Let  us  provide  the  BFS  algorithm  that,  given  a  digraph  G  of  order  n  and 
a  vertex  v  with  radius(u,  G)  <  +oo,  computes  a  BFS  tree  Tbfs  rooted  at  v. 
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[Informal  description]  Initialize  a  subgraph  to  contain  only  the 
root  v.  Repeat  radius(u,  G )  times  the  following  instructions: 
attach  to  the  subgraph  all  out-neighbors  of  the  subgraph  as  well 
as  a  single  connecting  edge  for  each  out-neighbor.  The  final 
subgraph  is  the  desired  directed  tree. 


The  algorithm  is  formally  stated  as  follows. 

function  BFS(G,  v) 

1:  (Ri,£i)  :=(W,0) 

2:  for  k  =  2  to  radius (u,  G)  do 

3:  find  all  vertices  wm  not  in  I4-i  that  are  out-neighbors  of 

some  vertex  in  Vk-i  and,  for  j  E  {1, . . .  ,m},  let  e3  be  an  edge 
connecting  a  vertex  in  I4_i  to  Wj 
4:  Vfc  . —  Vfc— 1  U{u>l,  .  .  .  ,  Wm } 

5:  Ek  • —  Ek~ i  U{ei, . . . ,  em} 

6:  return  (Vn,En) 

Note  that  the  output  of  this  algorithm  is  not  necessarily  unique,  since  the 
choice  of  edges  at  step  3:  in  the  algorithm  is  not  unique.  Figure  1.10  shows 
an  execution  of  the  BFS  algorithm. 


Figure  1.10  Execution  of  the  BFS  ALGORITHM.  In  the  leftmost  frame,  vertex  v  is  colored 
in  red.  The  other  frames  correspond  to  incremental  additions  of  vertices  and 
edges  as  specified  by  the  function  BFS.  The  output  of  the  algorithm  is  a 
BFS  tree  of  the  digraph.  The  BFS  tree  is  represented  in  the  last  frame  with 
vertices  and  edges  colored  in  red. 


Some  properties  of  the  BFS  algorithm  are  characterized  as  follows. 

Lemma  1.25  (BFS  tree).  For  a  digraph  G  with  a  vertex  v,  any  digraph 
T  computed  by  the  BFS  algorithm,  T  E  BFS(G,  v),  has  the  following 
properties: 

(i)  T  is  a  directed  tree  with  root  v; 

(ii)  T  contains  a  shortest  path  from  v  to  any  other  vertex  reachable  from 
v  inside  G,  that  is,  if  there  is  a  path  in  G  from  v  to  w,  then  w  E  T 
and  distc(u,u;)  =  dist t(v,w);  and 

(in)  if  G  contains  a  spanning  tree  rooted  at  v,  then  T  is  spanning  too 
and  therefore,  T  is  a  BFS  tree  of  G. 
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We  leave  the  proof  to  the  reader.  The  key  property  of  the  algorithm  is  that 
(Vk,Ek),  k  £  {1, . . .  ,n},  is  a  sequence  of  directed  trees  with  the  property 
that  (Vfc,  Ek)  c  (Vk+i,Ek+l),  for  k  £  {1, ...  ,n  —  1}. 


1.4. 4. 2  Depth- first  spanning  tree 

Next,  we  define  the  DFS  algorithm  that,  given  a  digraph  G  and  a  vertex  v 
with  radius(u,  G)  <  +oo,  computes  what  we  term  a  depth-first  spanning 
(DFS)  tree  Tdfs  rooted  at  v. 

[Informal  description]  Visit  all  nodes  of  the  graph  recording  the 
traveled  edges  to  form  the  desired  tree.  Visit  the  nodes  in  the 
following  recursive  way:  (1)  as  long  as  a  node  has  an  unvisited 
child,  visit  it,  (2)  when  the  node  has  no  more  unvisited  children, 
then  return  to  its  parent  (and  recursively  attempt  to  visit  its 
unvisited  children) . 


The  algorithm  is  formally  stated  as  a  recursive  procedure  as  follows, 
function  DFS(G,  v) 

(^visited)  -^visited)  •  (  { V } ,  0 ) 

2:  DFS- Visit (G,v) 

3:  return  (Kisited,  ^visited) 

function  DFS-Visit(G,  w) 
l:  for  u  out- neighbor  of  w  do 
2:  if  u  does  not  belong  to  Kdsited  then 

3:  ^visited  :=  ^visited  U{u} 

4:  Fvisit.ed  . —  Fvisit.e,]  U  {  i'll' .  'll)  } 

5:  DFS-VlSIT(G,  u) 

Note  that  the  output  of  this  algorithm  is  not  necessarily  unique,  since  the 
order  in  which  the  vertices  are  chosen  in  step  l :  of  DFS-Visit  is  not  unique. 
Any  digraph  T  computed  by  the  DFS  algorithm,  T  £  DFS(G,  v),  is  a 
directed  spanning  tree  with  root  v.  Figure  1.11  shows  an  execution  of  the 
algorithm. 

Some  properties  of  the  DFS  algorithm  are  characterized  as  follows. 

Lemma  1.26  (DFS  tree).  For  a  digraph  G  with  a  vertex  v,  any  digraph 
T  computed  by  the  DFS  algorithm,  T  £  DFS(G,  v),  has  the  following 
properties: 
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Figure  1.11  Execution  of  the  DFS  ALGORITHM.  In  the  top  leftmost  frame,  vertex  v  is 
colored  in  red.  The  other  frames  correspond  to  incremental  additions  of 
vertices  and  edges  as  specified  by  the  function  DFS.  The  output  of  the 
algorithm  is  a  DFS  tree  of  the  digraph.  The  DFS  tree  is  represented  in  the 
last  frame  with  vertices  and  edges  colored  in  red. 


(i)  T  is  a  directed  tree  with  root  v;  and 

(ii)  if  G  contains  a  spanning  tree  rooted  at  v,  then  T  is  spanning  too. 

Note  that  both  BFS  and  DFS  trees  are  uniquely  defined  once  a  lexico¬ 
graphic  order  is  introduced  for  the  children  of  a  node. 


1.4. 4. 3  Shortest-paths  tree  in  weighted  digraphs  via  Dijkstra's  algorithm 

Finally,  we  focus  on  weighted  digraphs  and  on  the  notion  of  weighted  path 
length.  Given  a  weighted  digraph  G  of  order  n  with  weighted  adjacency 
matrix  A  and  a  vertex  v  with  radius  (v,  G )  <  +oo,  a  shortest-paths  tree  of  G 
with  respect  to  v,  denoted  Tshortest-paths;  is  a  spanning  directed  tree  rooted 
at  v  that  contains  a  (weighted)  shortest  path  from  v  to  every  other  vertex 
of  G.  This  tree  differs  from  the  BFS  tree  defined  above  because  here  the 
path  length  is  measured  using  the  digraph  weights. 

We  now  provide  the  Dijkstra  algorithm  that,  given  a  digraph  G  of 
order  n  and  a  vertex  v  with  radius  (v,  G)  <  +oo,  computes  a  shortest-paths 
tree  Tshortest-paths  rooted  at  v . 


[Informal  description]  Incrementally  construct  a  tree  that  con¬ 
tains  only  shortest  paths.  At  each  round,  add  to  the  tree  (1)  the 
node  that  is  closest  to  the  source  and  is  not  yet  in  the  tree,  and 
(2)  the  edge  corresponding  to  the  shortest  path.  The  weighted 
distance  to  the  source  (required  to  perform  step  (1))  is  computed 
via  an  array  of  distance  estimates  that  is  updated  as  follows: 
when  a  node  is  added  to  the  tree,  the  distance  estimates  of  all 
its  out-neighbors  are  updated. 
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The  algorithm  is  formally  stated  as  follows. 


function  Dijkstra((V,  E,  A),  u) 


1 :  Tshortest-paths  • —  @ 

%  Initialize  estimated  distances  and  estimated  parent  nodes 

2:  for  u  G  V  do 


3:  dist(u) 


0,  u  =  v, 
Too,  otherwise. 


4:  parent  (it)  :=  u 

%  Main  loop  to  grow  the  tree  and  update  estimates 
5:  while  (Tshortest-paths  does  not  contain  all  vertices)  do 
6:  find  vertex  u  outside  Tshortest-paths  with  smallest  dist('u) 

7:  add  to  Tshortest_paths  the  vertex  u 

8:  if  u  /  v,  add  to  Tshortest_paths  the  edge  (parent  (u),  u) 

9:  for  each  node  w  that  is  an  out-neighbor  of  u  in  (V,  E,  A)  do 

10:  if  dist(u;)  >  dist(u)  T  auw  then 

li:  dist(u;)  :=  dist(u)  T  auw 

12:  parent(tc)  :=  u 

13:  return  Tshortest-paths 


Note  that  the  output  of  this  algorithm  is  not  necessarily  unique,  since  the 
choice  of  vertex  at  step  6:  in  the  algorithm  is  not  unique.  Figure  1.12  shows 
an  execution  of  the  Dijkstra’s  algorithm. 


Figure  1.12  Execution  of  the  Dijkstra’s  algorithm  on  the  weighted  digraph  plotted  in 
Figure  1.9.  In  the  top  leftmost  frame,  vertex  v  is  colored  in  gray.  The  other 
frames  correspond  to  incremental  additions  of  vertices  and  edges  as  specified 
by  the  function  Dijkstra.  The  output  of  the  algorithm  is  a  shortest-paths 
tree  of  the  digraph  rooted  at  v.  This  tree  is  represented  in  the  last  frame  with 
vertices  and  edges  colored  in  gray. 


The  following  properties  of  the  Dijkstra’s  algorithm  mirrors  those  of  the 
BFS  algorithm  in  Lemma  1.25. 
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Lemma  1.27  (Dijkstra’s  algorithm).  For  a  weighted  digraph  G  with 
a  vertex  v,  any  digraph  T  computed  by  the  Dijkstra  algorithm,  T  £ 
Dijkstra(G',  v),  has  the  following  properties: 

(i)  T  is  a  directed  tree  with  root  v; 

(ii)  T  contains  a  shortest  path  from  v  to  any  other  vertex  reachable  from 
v  inside  G,  that  is,  if  there  is  a  path  in  G  from  v  to  w,  then  w  £  T 
and  wdistG(u,  w)  =  wdist t(v,w);  and 

(Hi)  if  G  contains  a  spanning  tree  rooted  at  v,  then  T  is  spanning  too, 
and  therefore,  T  is  a  shortest-paths  tree  of  G. 


1.4. 4. 4  On  combinatorial  optimization  problems 

We  conclude  this  section  on  graph  algorithms  with  a  brief  mention  of  classic 
optimization  problems  defined  on  graphs.  Standard  references  on  combina¬ 
torial  optimization  include  [Korte  and  Vygen,  2005,  Vazirani,  2001].  Given 
a  weighted  directed  graph  G,  classical  combinatorial  optimization  problems 
include  the  following: 


Minimum-weight  spanning  tree.  A  minimum-weight  spanning  tree  (MST) 
of  G,  denoted  TmsTj  is  a  spanning  tree  with  the  minimum  possible 
weight.  In  order  for  the  MST  to  exist,  G  must  contain  a  spanning 
tree.  If  all  the  weights  of  the  individual  edges  are  different,  then  the 
MST  is  unique; 

Traveling  salesperson  problem.  A  traveling  salesperson  tour  (TSP)  of 
G,  denoted  Ttsp,  is  a  cycle  that  passes  through  all  the  nodes  of  the 
digraph  and  has  the  minimum  possible  weight.  In  order  for  the  TSP 
to  exist,  G  must  contain  a  cycle  through  all  nodes;  and 

Multicenter  optimization  problems.  Given  a  weighted  digraph  with  ver¬ 
tices  V  =  {vi, . . . ,  vn}  and  a  set  U  =  {u\, . . . ,  uk}  C  V,  the  weighted 
distance  from  v  £  V  to  the  set  U  is  the  smallest  weighted  distance  from 
v  to  any  vertex  in  {u\, . . . ,  uk} ■  We  now  consider  the  cost  functions 
Hmav,  hiv.  :  Vk  — >  R  defined  by 

'Hmax('Wi)  ...,uk)=  max  min  wdistc^i,  uh), 

n 

ftsCui,  ...,uk)  =  }  min  wdistG(r>;,  uh). 

“  he{l,...,k} 

1=1 

The  k- center  problem  and  the  k-median  problem  consist  of  finding  a  set 
of  vertices  {u\, . . . ,  uk}  that  minimizes  the  k-center  function  7imax  and 
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the  k-median  function  Tts,  respectively.  We  refer  to  [Vazirani,  2001] 
for  a  discussion  of  the  fc-center  and  fc-median  problems  (as  well  as 
the  more  general  uncapacited  facility  location  problem)  over  complete 
undirected  graphs  with  edge  costs  satisfying  the  triangle  inequality. 


The  Euclidean  versions  of  these  combinatorial  optimization  problems  refer 
to  the  situation  where  one  considers  a  weighted  complete  digraph  whose 
vertex  set  is  a  point  set  in  Mrf,  d  G  N,  and  whose  weight  map  assigns  to  each 
edge  the  Euclidean  distance  between  the  two  nodes  connected  by  the  edge. 


1.4.5  Algebraic  graph  theory 


Algebraic  graph  theory  [Godsil  and  Royle,  2001,  Biggs,  1994]  is  the  study 
of  matrices  defined  by  digraphs:  in  this  section  we  expose  two  topics.  First, 
we  study  the  equivalence  between  properties  of  graphs  and  of  their  associ¬ 
ated  adjacency  matrices.  We  also  specify  how  to  associate  a  digraph  to  a 
nonnegative  matrix.  Second,  we  introduce  and  characterize  the  Laplacian 
matrix  of  a  weighted  digraph. 


We  begin  by  studying  adjacency  matrices.  Note  that  the  adjacency  matrix 
of  a  weighted  digraph  is  nonnegative  and,  in  general,  not  stochastic.  The 
following  lemma  expands  on  this  point. 


Lemma  1.28  (Weight-balanced  digraphs  and  doubly  stochastic  ad¬ 
jacency  matrices).  Let  G  be  a  weighted  digraph  of  order  n  with  weighted 
adjacency  matrix  A  and  weighted  out-degree  matrix  Dout.  Define  the  matrix 


F  = 


DoutA 
{In  T  Doutj 


-1 


{In  +  A ), 


if  each  out- degree  is  strictly  positive, 
otherwise. 


Then 


(i)  F  is  row-stochastic;  and 

(ii)  F  is  doubly  stochastic  if  G  is  weight-balanced  and  the  weighted  degree 
is  constant  for  all  vertices. 


Proof.  Consider  first  the  case  when  each  vertex  has  an  outgoing  edge  so  that 
Dout  is  invertible.  We  first  note  that  diag(u)_1v  =  ln,  for  each  v  G  (M\{0})n. 
Therefore 

{D~^tA)ln  =  diag(Aln)_1(Aln)  =  ln, 
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which  proves  (i).  Furthermore,  if  Dout  =  D[n  =  dln  for  some  d  £  M>o,  then 
{Dont  A)Tln  =  2(ATln)  =  Dr1(ATln)  =  diag(^Tln)_1(Rrln)  =  ln, 

which  proves  (ii).  Finally,  if  ( V,E,A )  does  not  have  outgoing  edges  at  each 
vertex,  then  apply  the  statement  to  the  weighted  digraph  (V,  E  U{(i,  *)  |  i  £ 
{!,■■■,«}},  A  +  In).  ■ 


The  next  result  characterizes  the  relationship  between  the  adjacency  ma¬ 
trix  and  directed  paths  in  the  digraph. 

Lemma  1.29  (Directed  paths  and  powers  of  the  adjacency  matrix). 

Let  G  be  a  weighted  digraph  of  order  n  with  weighted  adjacency  matrix  A, 
with  unweighted  adjacency  matrix  Aq  i  £  {0,  l}nxn,  and  possibly  with  self¬ 
loops.  For  all  i,j,kE{l,...,n} 

(if  the  (i,j)  entry  of  A$1  equals  the  number  of  directed  paths  of  length 
k  (including  paths  with  self-loops)  from  node  i  to  node  j;  and 

(ii)  the  ( i,j )  entry  of  Ak  is  positive  if  and  only  if  there  exists  a  directed 
path  of  length  k  (including  paths  with  self-loops)  from  node  i  to 
node  j . 


Proof.  The  second  statement  is  a  direct  consequence  of  the  first.  The  first 
statement  is  proved  by  induction.  The  statement  is  clearly  true  for  k  =  1. 
Next,  we  assume  the  statement  is  true  for  k  >  1  and  we  prove  it  for  k  +  1. 
By  assumption,  the  entry  {Ak)ij  equals  the  number  of  directed  paths  from 
i  to  j  of  length  k.  Note  that  each  path  from  i  to  j  of  length  k  +  1  identifies 
(1)  a  unique  node  £  such  that  (i,£)  is  an  edge  of  G  and  (2)  a  unique  path 
from  l  to  j  of  length  k.  We  write  Ak+l  =  AAk  in  components  as 

n 

(^‘+1)«  =  E  Au(Ak)tj. 

1=  1 

Therefore,  it  is  true  that  the  entry  ( Ak+1)ij  equals  the  number  of  directed 
paths  from  i  to  j  of  length  k  + 1.  This  concludes  the  induction  argument.  ■ 


The  following  proposition  characterizes  in  detail  the  relationship  between 
various  connectivity  properties  of  the  digraph  and  algebraic  properties  of 
the  adjacency  matrix.  The  result  is  illustrated  in  Figure  1.13  and  its  proof 
is  postponed  to  Section  1.8.3. 

Proposition  1.30  (Connectivity  properties  of  the  digraph  and  pos¬ 
itive  powers  of  the  adjacency  matrix).  Let  G  be  a  weighted  digraph 
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Figure  1.13  Illustration  of  Proposition  1.30.  Even  though  vertices  2  and  3  are  globally 
reachable,  the  digraph  is  not  strongly  connected  because  vertex  1  has  no 
in-neighbor  other  than  itself.  Therefore,  the  associated  adjacency  matrix 
A  =  ( a,ij )  with  (aij)  =  13,  ( a2j )  =  (a3 j)  =  (0, 1, 1),  is  reducible. 


of  order  n  with  weighted  adjacency  matrix  A.  The  following  statements  are 
equivalent: 

(i)  G  is  strongly  connected, 

(ii)  A  is  irreducible;  and 
(in)  is  positive. 

For  any  j  £  {1, . . . ,  n},  the  following  two  statements  are  equivalent: 

(iv)  the  jth  node  of  G  is  globally  reachable;  and 

(v)  the  jth  column  ofYZ/kZ o  Ak  has  positive  entries. 

Stronger  statements  can  be  given  for  digraphs  with  self-loops. 

Proposition  1.31  (Connectivity  properties  of  the  digraph  and  posi¬ 
tive  powers  of  the  adjacency  matrix:  continued).  Let  G  be  a  weighted 
digraph  of  order  n  with  weighted  adjacency  matrix  A  and  with  self-loops  at 
each  node.  The  following  statements  are  equivalent: 

(iv)  G  is  strongly  connected;  and 

(v)  An_1  has  positive  entries. 

For  any  j  G  (1, . . . ,  n},  the  following  two  statements  are  equivalent: 

(iv)  the  jth  node  of  G  is  globally  reachable;  and 

(v)  the  jth  column  of  A12^1  has  positive  entries. 

Next,  we  characterize  the  relationship  between  irreducible  aperiodic  di¬ 
graphs  and  primitive  matrices  (recall  Definition  1.10).  We  postpone  the 
proof  to  Section  1.8.3. 
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Proposition  1.32  (Strongly  connected  and  aperiodic  digraph  and 
primitive  adjacency  matrix).  Let  G  be  a  weighted  digraph  of  order  n  with 
weighted  adjacency  matrix  A.  The  following  two  statements  are  equivalent: 

(i)  G  is  strongly  connected  and  aperiodic;  and 

(ii)  A  is  primitive,  i.e.,  there  exists  k  &  N  such  that  Ak  is  positive. 

This  concludes  our  study  of  adjacency  matrices  associated  to  weighted 
digraphs.  Next,  we  emphasize  how  all  results  obtained  so  far  have  analogues 
that  hold  when  the  original  object  is  a  nonnegative  matrix,  instead  of  a 
weighted  digraph. 

Remark  1.33  (From  a  nonnegative  matrix  to  its  associated  di¬ 
graphs).  Given  a  nonnegative  n  x  n  matrix  A,  its  associated  weighted  di¬ 
graph  is  the  weighted  digraph  with  nodes  {1, . . . ,  n},  and  weighted  adjacency 
matrix  A.  The  unweighted  version  of  this  weighted  digraph  is  called  the  as¬ 
sociated  digraph.  The  following  statements  are  analogues  of  the  previous 
lemmas: 

(i)  If  A  is  stochastic,  then  its  associated  digraph  has  weighted  out- 
degree  matrix  equal  to  In ; 

(ii)  if  A  is  doubly  stochastic,  then  its  associated  weighted  digraph  is 
weight-balanced  and,  additionally,  both  in-degree  and  out-degree 
matrices  are  equal  to  Jn;  and 

(iii)  A  is  irreducible  if  and  only  if  its  associated  weighted  digraph  is 

strongly  connected.  • 

So  far  we  have  analyzed  in  detail  the  properties  of  adjacency  matrices.  We 
conclude  this  section  by  studying  a  second  relevant  matrix  associated  to  a 
digraph,  called  the  Laplacian  matrix.  The  Laplacian  matrix  of  the  weighted 
digraph  G  is 


L(G)  =  Dout(G)  -  A(G). 

Some  immediate  consequences  of  this  definition  are  the  following: 

(i)  L(G)  ln  =  0n,  that  is,  0  is  an  eigenvalue  of  L(G)  with  eigenvector 
Ini 

(ii)  G  is  undirected  if  and  only  if  L(G)  is  symmetric;  and 

(iii)  L(G)  equals  the  Laplacian  matrix  of  the  digraph  obtained  by  adding 
to  or  removing  from  G  any  self-loop  with  arbitrary  weight. 

Further  properties  are  established  as  follows. 
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Theorem  1.34  (Properties  of  the  Laplacian  matrix).  Let  G  be  a 

weighted  digraph  of  order  n.  The  following  statements  hold: 

(i)  all  eigenvalues  of  L(G)  have  nonnegative  real  part  (thus,  if  G  is 
undirected,  then  L{G)  is  symmetric  positive  semidefinite) ; 

(ii)  if  G  is  strongly  connected,  then  rank(L(G))  =  n  —  l,  that  is,  0  is  a 
simple  eigenvalue  of  L{G); 

(in)  G  contains  a  globally  reachable  vertex  if  and  only  i/rank(L(G))  = 
n  —  1; 

(iv)  the  following  three  statements  are  equivalent: 

(a)  G  is  weight-balanced; 

(b)  1  nL(G)  =  0£;  and 

(c)  L(G)  +  L(G)t  is  positive  semidefinite. 

1.5  DISTRIBUTED  ALGORITHMS  ON  SYNCHRONOUS 
NETWORKS 

Here  we  introduce  a  synchronous  network  as  a  group  of  processors  with  the 
ability  to  exchange  messages  and  perform  local  computations.  What  we 
present  is  a  basic  classic  model  studied  extensively  in  the  distributed  algo¬ 
rithms  literature.  Our  treatment  is  directly  adopted  with  minor  variations 
from  the  texts  by  Lynch  [1997]  and  Peleg  [200*  ]. 


1.5.1  Physical  components  and  computational  models 

Loosely  speaking,  a  synchronous  network  is  a  group  of  processors,  or  nodes, 
that  possess  a  local  state,  exchange  messages  along  the  edges  of  a  digraph, 
and  compute  an  update  to  their  local  state  based  on  the  received  messages. 
Each  processor  alternates  the  two  tasks  of  exchanging  messages  with  its 
neighboring  processors  and  of  performing  a  computation  step.  Let  us  begin 
by  describing  what  constitutes  a  network. 

Definition  1.35  (Network).  The  physical  component  of  a  synchronous 
network  S  is  a  digraph  (/,  Lcmm),  where 

(i)  /  =  {1, . . . ,  n}  is  called  the  set  of  unique  identifiers  ( UIDs );  and 

(ii)  Emm  is  a  set  of  directed  edges  over  the  vertices  { 1, . . .  ,n},  called 

the  communication  links.  • 

In  general,  the  set  of  unique  identifiers  does  not  need  to  be  n  consecutive 
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natural  numbers,  but  we  take  this  convention  for  simplicity.  The  set  Ecmm 
models  the  topology  of  the  communication  service  among  the  nodes:  for 
i,j  £  {1, . . . ,  n},  processor  i  can  send  a  message  to  processor  j  if  the  directed 
edge  (i,j)  is  present  in  Ecmm.  Note  that,  unlike  the  standard  treatments 
in  [Lynch,  1997,  Peleg,  2000],  we  do  not  assume  the  digraph  to  be  strongly 
connected;  the  required  connectivity  assumption  will  be  specified  on  a  case 
by  case  basis. 

Next,  we  discuss  the  state  and  the  algorithms  that  each  processor  possesses 
and  executes,  respectively.  By  convention,  we  let  the  superscript  [i]  denote 
any  quantity  associated  with  the  node  i. 

Definition  1.36  (Distributed  algorithm).  A  distributed  algorithm  T>A 
for  a  network  S  consists  of  the  sets 

(i)  A,  a  set  containing  the  null  element,  called  the  communication 
alphabet ;  elements  of  A  are  called  messages ; 

(ii)  IpW,  i  £  I,  called  the  processor  state  sets ;  and 

(iii)  Wq^  C  W M,  %  £  /,  sets  of  allowable  initial  values ; 

and  of  the  maps 

(i)  msgW  :  W®  x  I  — >  A,  i  £  I,  called  message-generation  functions', 
and 

(ii)  stfW  :  VpW  x  An  — >  W^\  i  £  /,  called  state-transition  functions. 

If  W'W  =  w,  rrisgW  =  msg,  and  stf^  =  stf  for  all  i  £  I,  then  T>A  is  said  to 
be  uniform  and  is  described  by  a  tuple  (A,  W,  {W^ }ig/,  msg,  stf).  • 

Now,  with  all  elements  in  place,  we  can  explain  in  more  detail  how  a 
synchronous  network  executes  a  distributed  algorithm,  see  Figure  1.14.  The 


Figure  1.14  Execution  of  a  distributed  algorithm  by  a  synchronous  network. 

state  of  processor  i  is  a  variable  w  M  £  W^\  initially  set  equal  to  an  allowable 
value  in  WqK  At  each  time  instant  i  £  Z>o,  processor  i  sends  to  each 
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of  its  out-neighbors  j  in  the  communication  digraph  (/,  Ec mm)  a  message 
(possibly  the  null  message)  computed  by  applying  the  message-generation 
function  msgW  to  the  current  values  of  its  state  and  to  the  identity  j. 
Subsequently,  but  still  at  time  instant  t  G  Z>o,  processor  i  updates  the  value 
of  its  state  by  applying  the  state-transition  function  stf^  to  the  current 
value  of  its  state  w M  and  to  the  messages  it  receives  from  its  in-neighbors. 
Note  that,  at  each  round,  the  first  step  is  transmission  and  the  second  one 
is  computation.  These  notions  are  formalized  in  the  following  definition. 


Definition  1.37  (Network  evolution).  Let  DA  be  a  distributed  algo¬ 
rithm  for  the  network  S.  The  evolution  of  (S.  VA)  from  initial  conditions 
iCq^  G  Wq  \  i  E  /,  is  the  collection  of  trajectories  it;W  :  Z>o  — +  W^\  i  G  /, 
satisfying 

w®(£)  =  st^(w®(£-l),y®(£)), 


where  u;W(— 1)  =  ,  i  G  I,  and  where  the  trajectory  yW  :  Z>o  — >  An 

(describing  the  messages  received  by  processor  i )  has  components  (£).  for 
j  G  I,  given  by 


msg W(u;[Jl(£-  1),  7),  if  (, j,i )  G  Ecmm 

null,  otherwise. 


We  let  i  e->  w(£)  =  («;W  (£),...  ,  w^(£))  denote  the  collection  of  trajectories. 


We  conclude  this  section  with  two  sets  of  remarks.  We  first  discuss  some 
aspects  of  our  communication  model  that  have  a  large  impact  on  the  sub¬ 
sequent  development.  We  then  collect  a  few  general  comments  about  dis¬ 
tributed  algorithms  on  networks. 


Remarks  1.38  (Aspects  of  the  communication  model). 

(i)  The  network  S  and  the  algorithm  DA  are  referred  to  as  synchronous 
because  the  communications  between  all  processors  takes  place  at 
the  same  time  for  all  processors. 

(ii)  Communication  is  modeled  as  a  so-called  “point  to  point”  service:  a 
processor  can  specify  different  messages  for  different  out-neighbors 
and  knows  the  processor  identity  corresponding  to  any  incoming 
message. 

(iii)  Information  is  exchanged  between  processors  as  messages,  i.e.,  ele¬ 
ments  of  the  alphabet  A;  the  message  null  indicates  no  communi¬ 
cation.  Messages  might  encode  logical  expressions  such  as  true  and 
false,  or  finite-resolution  quantized  representations  of  integer  and 
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real  numbers. 

(iv)  In  some  uniform  algorithms,  the  messages  between  processors  are 
the  processors’  states.  In  such  cases,  the  corresponding  commu¬ 
nication  alphabet  is  A  =  W  U{null}  and  the  message  generation 
function  msgstc](ic,  j)  =  w  is  referred  to  as  the  standard  message- 
generation  function.  • 


Remarks  1.39  (Advanced  topics:  Control  structures  and  failures). 

(i)  Processors  in  a  network  have  only  partial  information  about  the 
network  topology.  In  general,  each  processor  only  knows  its  own 
UID,  and  the  UID  of  its  in-  and  out-neighbors.  Sometimes  we  will 
assume  that  the  processor  knows  the  network  diameter.  In  some 
cases  [Peleg,  2000]  actively  running  networks  might  depend  upon 
“control  structures,”  i.e. ,  structures  that  are  computed  at  initial 
time  and  are  exploited  in  subsequent  algorithms.  For  example, 
routing  tables  might  be  computed  for  routing  problems,  “leader” 
processors  might  be  elected  and  tree  structures  might  be  computed 
and  represented  in  a  distributed  manner  for  various  tasks,  e.g.,  color¬ 
ing  or  maximal  independent  set  problems.  We  present  some  sample 
algorithms  to  compute  these  structures  below. 

(ii)  A  key  issue  in  the  study  of  distributed  algorithms  is  the  possible 

occurrence  of  failures.  A  network  might  experience  intermittent 
or  permanent  communication  failures:  along  given  edges  a  null 
message  or  an  arbitrary  message  might  be  delivered  instead  of  the 
intended  value.  Alternatively,  a  network  might  experience  various 
types  of  processor  failures:  a  processor  might  transmit  only  null 
messages  (i.e.,  the  msg  function  returns  null  always),  a  processor 
might  quit  updating  its  state  (i.e.,  the  stf  function  neglects  incoming 
messages  and  returns  the  current  state  value),  or  a  processor  might 
implement  arbitrarily  modified  msg  and  stf  functions.  The  latter 
situation,  in  which  completely  arbitrary  and  possibly  malicious  be¬ 
havior  is  adopted  by  faulty  nodes,  is  referred  to  as  a  Byzantine 
failure  in  the  distributed  algorithms  literature.  • 


1.5.2  Complexity  notions 

Here  we  begin  our  analysis  of  the  performance  of  distributed  algorithms. 
We  introduce  a  notion  of  algorithm  completion  and,  in  turn,  we  introduce 
the  classic  notions  of  time,  space,  and  communication  complexity. 
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Definition  1.40  (Algorithm  completion).  We  say  that  an  algorithm 
terminates  when  only  null  messages  are  transmitted  and  all  processors 
states  become  constants.  • 


Remarks  1.41  (Alternative  termination  notions). 

(i)  In  the  interest  of  simplicity,  we  have  defined  evolutions  to  be  un¬ 
bounded  in  time  and  we  do  not  explicitly  require  algorithms  to 
actually  have  termination  conditions,  i.e. ,  to  be  able  to  detect  when 
termination  takes  place. 

(ii)  It  is  also  possible  to  define  the  termination  time  as  the  first  instant 
when  a  given  problem  or  task  is  achieved,  independently  of  the  fact 
that  the  algorithm  might  continue  to  transmit  data  subsequently.  • 

Definition  1.42  (Time  complexity).  The  (worst- case)  time  complexity 
of  a  distributed  algorithm  VA  on  a  network  S ,  denoted  TC(H4),  is  the 
maximum  number  of  rounds  required  by  the  execution  of  VA  on  S  among 
all  allowable  initial  states  until  termination.  • 

Next,  we  quantify  memory  and  communication  requirements  of  distributed 
algorithms.  From  an  information  theory  viewpoint  [Gallager,  1968],  the  in¬ 
formation  content  of  a  memory  variable  or  of  a  message  is  properly  mea¬ 
sured  in  bits.  On  the  other  hand,  it  is  convenient  to  use  the  alternative  no¬ 
tions  of  “basic  memory  unit”  and  “basic  message.”  It  is  customary  [Peleg, 
2000]  to  assume  that  a  “basic  memory  unit”  or  a  “basic  message”  contains 
log(n)  bits  so  that,  for  example,  the  information  content  of  a  robot  identi¬ 
fier  i  £  {1, . . .  ,n}  is  log(n)  bits  or,  equivalently,  one  “basic  memory  unit.” 
Note  that  elements  of  the  processor  state  set  W  or  of  the  alphabet  set  A 
might  amount  to  multiple  basic  memory  units  or  basic  messages;  the  null 
message  has  zero  cost.  Unless  specified  otherwise,  the  following  definitions 
and  examples  are  stated  in  terms  of  basic  memory  unit  and  basic  messages: 

Definition  1.43  (Space  complexity).  The  (worst-case)  space  complexity 
of  a  distributed  algorithm  VA  on  a  network  S,  denoted  by  80(714),  is  the 
maximum  number  of  basic  memory  units  required  by  a  processor  executing 
VA  on  S  among  all  processors  and  among  all  allowable  initial  states  until 
termination.  • 

Remark  1.44  (Space  complexity  conventions).  By  convention,  each 
processor  knows  its  identity,  i.e.,  it  requires  log(n)  bits  to  represent  its  unique 
identifier  in  a  set  with  n  distinct  elements.  We  do  not  count  this  cost  in  the 
space  complexity  of  an  algorithm.  • 

Next,  we  compute  the  communication  complexity  by  counting  basic  mes- 

40 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


sages. 

Definition  1.45  (Communication  complexity).  The  ( worst-case )  com¬ 
munication  complexity  of  a  distributed  algorithm  VA  on  a  network  S,  de¬ 
noted  by  CC(VA),  is  the  maximum  number  of  basic  messages  transmitted 
over  the  entire  network  during  the  execution  of  VA  among  all  allowable 
initial  states  until  termination.  • 

We  conclude  this  section  by  discussing  ways  of  quantifying  time,  space 
and  communication  complexity.  The  idea,  borrowed  from  combinatorial  op¬ 
timization,  is  to  adopt  asymptotic  “order  of  magnitude”  measures.  Formally, 
complexity  bounds  will  be  expressed  with  respect  to  the  Bachman-Laundau 
symbols  O,  17  and  0  defined  in  Section  1.1.  Let  us  be  more  specific. 

(i)  We  will  say  that  an  algorithm  has  time  complexity  of  order  17(/(n)) 
over  some  network  if,  for  all  n,  there  exists  a  network  of  order  n 
and  initial  processor  values  such  that  the  time  complexity  of  the 
algorithm  is  greater  than  a  constant  factor  times  /(n); 

(ii)  we  will  say  that  an  algorithm  has  time  complexity  of  order  0(/(n)) 
over  arbitrary  networks  if,  for  all  n,  for  all  networks  of  order  n  and 
for  all  initial  processor  values  the  time  complexity  of  the  algorithm 
is  lower  than  a  constant  factor  times  /(n);  and 

(iii)  we  will  say  that  an  algorithm  has  time  complexity  of  order  0(/(n)) 
if  its  time  complexity  is  of  order  17  (/(n))  over  some  network  and 
0(f(n))  over  arbitrary  networks  at  the  same  time. 

Similar  conventions  will  be  used  for  space  and  communication  complexity. 

In  many  cases  the  complexity  of  an  algorithm  will  typically  depend  upon 
the  number  of  vertices  of  the  network.  It  is  therefore  useful  to  present  a 
few  simple  facts  about  these  functions  now.  Over  arbitrary  digraphs  S  = 
(I,  Ecmm)  of  order  n,  we  have 

diam(5)  S  0(n),  | -E7mm(£)|  £  0(n2)  and  radius(u, 5)  6  0(diam(5)), 

where  v  is  any  vertex  of  S. 

Remark  1.46  (Additional  complexity  notions).  Numerous  variations 
of  the  proposed  complexity  notions  are  possible  and  may  be  of  interest. 


Global  lower  bounds.  In  the  definition  of  lower  bound,  consider  the  logic 
quantifier  describing  the  role  of  the  network.  The  lower  bound  state¬ 
ment  is  “existential”  rather  than  “global,”  in  the  sense  that  the  bound 
does  not  hold  for  all  graphs.  As  discussed  in  [Peleg,  2000],  it  is  pos¬ 
sible  to  define  also  “global”  lower  bounds,  i.e.,  lower  bounds  over  all 
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graphs,  or  lower  bounds  over  specified  classes  of  graphs. 

Average  complexity  notions.  The  proposed  complexity  notions  focus  on 
the  worst-case  situation.  It  is  also  possible  to  define  expected  or  aver¬ 
age  complexity  notions,  where  one  is  be  interested  in  characterizing, 
for  example,  the  average  number  of  rounds  required  or  the  average 
number  of  basic  messages  transmitted  over  the  entire  network  during 
the  execution  of  an  algorithm  among  all  allowable  initial  states  until 
termination. 

Problem  complexity.  It  is  possible  to  define  complexity  notions  for  prob¬ 
lems,  rather  than  algorithms,  by  considering,  for  example,  the  worst- 
case  optimal  performance  among  all  algorithms  that  solve  the  given 
problem,  or  over  classes  of  algorithms  or  classes  of  graphs.  • 


1.5.3  Broadcast  and  BFS  tree  computation 

In  the  following,  we  consider  some  basic  algorithmic  problems  such  as  the 
simple  one-to-all  communication  task,  i.e.,  broadcasting,  and  the  establish¬ 
ment  of  some  “control  structures,”  see  Remarks  1.39,  such  as  the  construc¬ 
tion  of  a  BFS  spanning  tree  and  the  election  of  a  leader. 

Problem  1.47  (Broadcast).  Assume  that  a  processor,  called  the  source, 
has  a  message,  called  the  token.  Transmit  the  token  to  all  others  processors. 


Note  that  existence  of  a  spanning  tree  rooted  at  the  source  is  a  neces¬ 
sary  requirement  for  the  broadcast  problem  to  be  solvable.  We  begin  by 
establishing  some  analysis  results  for  the  broadcast  problem. 

Lemma  1.48  (Complexity  lower  bounds  for  the  broadcast  prob¬ 
lem).  Let  S  be  a  network  containing  a  spanning  tree  rooted  at  v.  The 
broadcast  problem  for  S  from  the  source  v  has  communication  complexity 
lower  bounded  by  n  —  1  and  time  complexity  lower  bounded  by  radius(v,<S). 

In  what  follows,  we  shall  solve  the  broadcast  problem  while  simultaneously 
also  considering  the  following  problem. 

Problem  1.49  (BFS  tree  computation).  Let  S  be  a  network  containing 
a  spanning  tree  rooted  at  v.  Compute  a  distributed  representation  for  a 
BFS  tree  rooted  at  v.  • 

We  add  two  remarks  on  the  BFS  tree  computation  problem. 
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(i)  By  a  distributed  representation  of  a  directed  tree  with  bounded 
memory  at  each  node  we  mean  the  following:  each  child  vertex 
knows  the  identity  of  its  parent  and  the  root  vertex  knows  it  has  no 
parents.  A  more  informative  structure  would  require  each  parent  to 
know  the  identity  of  its  children;  this  is  easy  to  achieve  on  undirected 
digraphs. 

(ii)  The  BFS  tree  computation  has  the  same  lower  bounds  as  the  broad¬ 
cast  problem. 

An  elegant  and  classic  solution  to  the  broadcast  and  BFS  tree  compu¬ 
tation  problems  is  given  by  the  flooding  algorithm.  This  algorithm 
implements  the  same  “breadth-first  search”  mechanism  of  the  (centralized) 
BFS  algorithm  characterized  in  Lemma  1.25. 

[Informal  description]  The  source  broadcasts  the  token  to  its 
out-neighbors.  At  each  communication  round,  each  node  deter¬ 
mines  whether  it  has  received  a  non-null  message  from  one  of 
its  in-neighbors.  When  a  non-null  message  is  received,  i.e.,  the 
token  is  received,  the  node  performs  two  actions.  First,  the  node 
stores  the  token  in  the  variable  data  (this  solves  the  Broad¬ 
cast  problem).  Second,  the  node  stores  the  identity  of  one  of 
the  transmitting  in-neighbors  in  the  variable  parent  (this  solves 
the  BFS  tree  computation  problem).  Specifically,  if  the  message 
is  received  simultaneously  from  multiple  in-neighbors,  then  the 
node  stores  the  smallest  among  the  identities  of  the  transmitting 
in-neighbors.  At  the  subsequent  communication  round,  the  node 
broadcasts  the  token  to  its  out-neighbors. 

To  formally  describe  the  algorithm,  we  assume  that  the  node  with  the 
message  to  be  broadcast  is  v  =  1.  Also,  we  assume  that  the  token  is  a  letter 
of  the  Greek  alphabet  {a, . . .  ,lo}. 

Synchronous  Network:  S  =  ({1, . . . ,  n},  Ecmm) 

Distributed  Algorithm:  FLOODING 
Alphabet:  A  =  {a, . . . ,  u}  U  null 

Processor  State:  w  =  (parent,  data,  snd-f  lag),  where 
parent  S  {1, . . . ,  n}  U  null,  initially:  parent^  =  1, 

parent  W  =  null  for  all  j  y  1 
data  E  A,  initially:  data^  =  fi, 

data^  =  null  for  all  j  /  1 

snd-f  lag  £  {false,  true},  initially:  snd-f  lag!1!  =  true, 

snd-f lag^  =  false  for  all  j  /  1 
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function  msg('u>,  i) 

i:  if  (parent  ^  i )  AND  (snd-f  lag  =  true)  then 
2:  return  data 

3:  else 

4:  return  null 

function  stf (w,y) 
i:  case 

2:  (data  =  null)  AND  (y  contains  only  null  messages): 

%  The  node  has  not  yet  received  the  token 

3:  new-parent  :=  null 

4:  new-data  :=  null 

5:  new-snd-flag  :=  false 

6:  (data  =  null)  AND  ( y  contains  a  non-null  message): 

%  The  node  has  just  received  the  token 
7:  new-parent  :=  smallest  UID  among  transmitting  in- neighbors 

8:  new-data  :=  a  non-null  message 

9:  new-snd-flag  :=  true 

10:  (data  ^  null): 

%  If  the  node  already  has  the  token,  then  do  not  re-broadcast  it 

li:  new-parent  :=  parent 

12:  new-data  :=  data 

13:  new-snd-flag  :=  false 

14:  return  (new-parent, new-data, new-snd-flag) 


An  execution  of  the  flooding  algorithm  is  illustrated  in  Figure  1.15. 


Figure  1.15  Example  execution  of  the  flooding  algorithm.  The  source  is  vertex  1.  (a) 
shows  the  network  and  (b)  shows  the  BFS  tree  that  results  from  the  execution. 

This  algorithm  can  analyzed  by  induction:  one  can  show  that,  for  d  £ 
{1, ...,  radius  (v,  5)},  every  node  at  a  distance  d  from  the  root  receives  a 
non-null  message  at  round  d.  A  summary  of  results  is  given  as  follows. 
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Lemma  1.50  (Complexity  upper  bounds  for  the  flooding  algo¬ 
rithm).  For  a  network  S  containing  a  spanning  tree  rooted  at  v,  the  flood¬ 
ing  algorithm  has  communication  complexity  in  0(|-Ecmm|),  time  com¬ 
plexity  in  0(radius(u,  <S)),  and  space  complexity  in  0(1). 

Remark  1.51  (Termination  condition  for  the  flooding  algorithm). 

As  presented,  the  flooding  algorithm  does  not  include  a  termination  con¬ 
dition,  i.e.,  the  processors  do  not  have  a  mechanism  to  detect  when  the 
broadcast  and  tree  computation  are  complete.  If  an  upper  bound  on  the 
graph  diameter  is  known,  then  it  is  easy  to  design  a  termination  condition 
based  on  this  information;  we  do  this  in  the  next  subsection.  If  no  a  priori 
knowledge  is  available,  then  one  can  design  more  sophisticated  algorithms 
for  networks  with  stronger  connectivity  properties.  We  refer  to  [Lynch,  1997, 
Peleg,  2000]  for  a  complete  discussion  about  this.  • 


1.5.4  Leader  election 

Next,  we  formulate  another  interesting  problem  for  a  network. 

Problem  1.52  (Leader  election).  Assume  that  all  processors  of  a  net¬ 
work  have  a  state  variable,  say  leader,  initially  set  to  unknown.  We  say  that 
a  leader  is  elected  when  one  and  only  one  processor  has  the  state  variable 
set  to  true  and  all  others  have  it  set  to  false.  Elect  a  leader.  • 

This  is  a  task  that  is  a  bit  more  global  in  nature.  We  display  here  a 
solution  that  requires  individual  processors  to  know  the  diameter  of  the 
network,  denoted  by  diam(5),  or  an  upper  bound  on  it. 


[Informal  description]  At  each  communication  round,  each  agent 
sends  to  all  its  neighbors  the  maximum  UID  it  has  received  up 
to  that  time.  This  is  repeated  for  diam(5)  rounds.  At  the  last 
round,  each  agent  compares  the  maximum  received  UID  with  its 
own,  and  declares  itself  a  leader  if  they  coincide,  or  a  non-leader 
otherwise. 


The  algorithm  is  called  the  floodmax  algorithm:  the  maximum  UID  in 
the  network  is  transmitted  to  other  agents  in  an  incremental  fashion.  At 
the  first  communication  round,  agents  that  are  neighbors  of  the  agent  with 
the  maximum  UID  receive  the  message  from  it.  At  the  next  communication 
round,  the  neighbors  of  these  agents  receive  the  message  with  the  maximum 
UID.  This  process  goes  on  for  diam(5)  rounds  to  ensure  that  every  agent 
receives  the  maximum  UID.  Note  that  there  are  networks  for  which  all  agents 
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receive  the  message  with  the  maximum  UID  in  fewer  communication  rounds 
than  diam(5).  The  algorithm  is  formally  stated  as  follows. 


Synchronous  Network:  S  =  ({1, . . . ,  n},  Ecmm) 

Distributed  Algorithm:  FLOODMAX 
Alphabet:  A  =  {1, . . . ,  n}  U{null} 

Processor  State:  w  =  (my-id,  max-id,  leader,  round),  where 


my-id 

G  {1, . . .  ,n}, 

initially: 

my-idW  =  i  for  all  i 

max- id 

£{!,■•  •  ,n}, 

initially: 

max-id^  =  i  for  all  i 

leader 

G  {false, true, unknown}, 

initially: 

leader^  =  unknown  for  all 

round 

G  {0, 1, . . . ,  diam(S)}, 

initially: 

roundel  =  0  for  all  i 

function  msg (w,i) 
i:  if  round  <  diam(5)  then 
2:  return  max-id 

3:  else 

4:  return  null 

function  stf (w,y) 

i:  new-id:=  maxjmax- id,  largest  identifier  iny} 

2:  case 

3:  round  <  diam(5):  new-lead  :=  unknown 

4:  round  =  diam(5)  AND  max-id  =  my-id:  new-lead  :=  true 

5:  round  =  diam(5)  AND  max-id  >  my-id:  new-lead  :=  false 

6:  return  (my-id,  new-id,  new-lead,  round  +1) 


Figure  1.16  shows  an  execution  of  the  FLOODMAX  algorithm.  This  al¬ 
gorithm’s  properties  are  characterized  in  the  following  lemma.  A  complete 
analysis  of  this  algorithm,  including  modifications  to  improve  the  commu¬ 
nication  complexity,  is  discussed  in  [Lynch,  1997,  Section  4.1]. 


Figure  1.16  Execution  of  the  FLOODMAX  ALGORITHM.  The  diameter  of  the  network  is  4. 

In  the  leftmost  frame,  the  agent  with  the  maximum  UID  is  colored  in  red. 
After  4  communication  rounds,  its  message  has  been  received  by  all  agents. 


Lemma  1.53  (Complexity  upper  bounds  for  the  floodmax  algo¬ 
rithm).  For  a  network  S  containing  a  spanning  tree,  the  floodmax  al- 
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GORITHM  has  communication  complexity  in  0(diam(5)|£'cmm|)J  time  com¬ 
plexity  equal  to  diam(5),  and  space  complexity  in  0(1). 

A  simplification  of  the  FLOODMAX  algorithm  leads  to  the  Le  Lann- 
Chang- Roberts  algorithm  (or  LCR  algorithm)  for  leader  election  in  rings, 
see  [Lynch,  1997,  Chapter  3.3],  that  we  describe  next.  The  LCR  algorithm 
runs  on  a  ring  digraph  and  does  not  require  the  agents  to  know  the  diameter 
of  the  network. 


[Informal  description]  At  each  communication  round,  if  the  agent 
receives  from  its  in-neighbor  a  UID  that  is  larger  than  the  UIDs 
received  earlier,  then  the  agent  records  the  received  UID  and 
forwards  it  to  the  out-neighbor  during  the  following  communica¬ 
tion  round.  (Agents  do  not  record  the  number  of  communication 
rounds.)  When  the  agent  with  the  maximum  UID  receives  its 
own  UID  from  a  neighbor,  it  declares  itself  the  leader. 


The  algorithm  is  formally  stated  as  follows. 


Synchronous  Network:  ring  digraph 


Distributed  Algorithm:  LCR 
Alphabet:  A  =  {1, . . . ,  n}  U{null} 


Processor  State:  w  =  (my-id,  max-id,  leader,  snd-f  lag),  where 


my-id  E  {1, . . . ,  n},  initially: 

max-id  E  {1, . . . ,  n},  initially: 

leader  E  {true,  false,  unknown},  initially: 

snd-f  lag  E  {true,  false},  initially: 


my-idW  =  i  for  all  i 
max-idW  =  i  for  all  i 
leader^  =  unknown  for  all 
snd-f  lag!*!  =  true  for  all  i 


function  msg (w,i) 
l:  if  snd-f  lag  =  true  then 
2:  return  max-id 

3:  else 

4:  return  null 

function  stf (w,y) 
i:  case 

2:  ( y  contains  only  null  msgs)  OR  (largest  identifier  in y  <  my-id): 

3:  new-id  :=  max-id 

4:  new-lead  :=  leader 

5:  new-snd-f lag  :=  false 

6:  (largest  identifier  in y  =  my-id): 
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7:  new-id  :=  max-id 

8:  new-lead  :=  true 

9:  new-snd-flag  :=  false 

10:  (largest  identifier  iny  >  my-id): 

li:  new-id  :=  largest  identifier  in y 

12:  new-lead  :=  false 

13:  new-snd-flag  :=  true 

14:  return  (my-id,  new-id,  new-lead,  new-snd-flag) 


Figure  1.17  shows  an  execution  of  the  LCR  algorithm.  The  properties 


Figure  1.17  Execution  of  the  LCR  ALGORITHM.  In  the  leftmost  frame,  the  agent  with  the 
maximum  UID  is  colored  in  red.  After  5  communication  rounds,  this  agent 
receives  its  own  UID  from  its  in-neighbor  and  declares  itself  the  leader. 


of  the  LCR  ALGORITHM  can  be  characterized  as  follows. 

Lemma  1.54  (Complexity  upper  bounds  for  the  LCR  algorithm). 

For  a  ring  network  S  of  order  n,  the  LCR  algorithm  has  communication 
complexity  in  0(n2),  time  complexity  equal  to  n,  and  space  complexity  in 
0(1). 


1.5.5  Shortest-paths  tree  computation 

Finally,  we  consider  the  shortest-paths  tree  problem  in  a  weighted  digraph: 
in  Section  1.4.4  we  presented  Dijkstra’s  algorithm  to  solve  this  problem  in 
a  centralized  setting;  we  present  here  the  Bellman-Ford  algorithm  for  the 
distributed  setting. 

We  consider  a  synchronous  network  associated  to  a  weighted  digraph,  i.e., 
we  assume  that  a  strictly  positive  weight  is  associated  to  each  communication 
edge.  We  assume  that  the  source  is  vertex  1  and  we  aim  to  compute  a 
tree  containing  shortest  paths  from  node  1  to  all  other  nodes.  As  for  the 
computation  of  a  BFS  tree,  we  aim  to  obtain  a  distributed  representation 
of  a  directed  tree  with  bounded  memory  at  each  node. 


[Informal  description]  Each  agent  maintains  (1)  an  estimate  dist 
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of  its  weighted  distance  from  the  source,  and  (2)  an  estimate 
parent  of  the  in-neighbor  corresponding  to  the  (weighted)  short¬ 
est  path  from  the  source.  The  dist  estimate  is  initialized  to  0 
for  the  source  and  to  +oo  for  all  other  nodes.  At  each  com¬ 
munication  round,  each  agent  performs  the  following  tasks:  (1) 
it  transmits  its  dist  to  its  out-neighbors,  (2)  it  computes  the 
smallest  quantity  among  “the  dist  received  from  an  in- neighbor 
summed  with  the  edge  weight  corresponding  to  that  same  in¬ 
neighbor,”  and  (3)  if  the  agent’s  estimate  dist  is  larger  than 
this  quantity,  then  the  agent  updates  its  dist  and  its  estimate 
parent. 


The  algorithm  is  formally  stated  as  follows. 


Synchronous  Network  with  Weights:  S  =  ({1, . . . ,  n},  Ecmm,  A) 
Distributed  Algorithm:  DISTRIBUTED  BELLMAN-FORD 
Alphabet:  A  =  M>o  U null  U{+oo} 

Processor  State:  w  =  (parent,  dist),  where 

parent  G  {1, . . .  ,n},  initially:  parent^  =  j  for  all  j 

dist  G  A,  initially:  dataM  =  0, 

data^l  =  Too  for  all  j  ^  1 

function  msg(w;,  i) 

l:  if  round  <  n  then 
2:  return  dist 

3:  else 

4:  return  null 

function  stf (w,y) 
i:  i  :=  processor  UID 
2:  k  :=  arginfjyj  T  a,ji  |  for  all  yj  y  null} 

3:  if  (dist  <  k)  then 
4:  return  (parent,  dist) 

5:  else 

6:  return  (k,  yk  T  aki ) 


In  other  words,  if  we  let  di  G  R>o  U{Too}  denote  the  dist  variable  for  each 
processor  i.  then  the  Bellman-Ford  algorithm  is  equivalent  to  the  following 
discrete-time  dynamical  system: 

di(£  T  1)  =  inf  {di(tj ,  inf {dj(£)  +  a ^  \  ( j,i )  G  £cmm}}, 
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with  initial  conditions  d{ 0)  =  (1,  +oo, . . . ,  +oo).  (Recall  that  Ec mm  is  the 
edge  set  and  the  weights  atJ  are  strictly  positive  for  all  (i,j)  G  Ec mm.) 

This  algorithm’s  key  property  enabling  its  analysis  is  that,  after  k  com¬ 
munication  rounds,  the  estimated  distance  at  node  i  equals  the  shortest 
path  of  topological  length  at  most  k  from  the  source  to  node  i.  Therefore, 
after  n  —  1  communication  rounds,  all  possible  distinct  topological  paths 
connecting  source  to  node  i  have  been  investigated.  The  properties  of  the 
DISTRIBUTED  BELLMAN-FORD  ALGORITHM  as  follows. 

Lemma  1.55  (Complexity  upper  bounds  for  the  distributed  Bell- 
man-Ford  algorithm).  For  a  networks  of  order  n  containing  a  spanning 
tree  rooted  at  v,  the  distributed  Bellman-Ford  algorithm  has  com¬ 
munication  complexity  in  0(n|E'cmm|),  time  complexity  equal  to  n  —  1,  and 
space  complexity  in  0(1). 

Figure  1.18  shows  an  execution  of  the  distributed  Bellman-Ford  al¬ 
gorithm. 

In  the  leftmost  frame,  the  agent  with  the  maximum  UID  is  colored  in 
red.  After  5  communication  rounds,  this  agent  receives  its  own  UID  from 
its  in- neighbor  and  declares  itself  the  leader. 

1.6  LINEAR  DISTRIBUTED  ALGORITHMS 

Computing  linear  combination  of  the  initial  states  of  the  processors  is  one 
of  the  most  basic  computation  that  we  might  be  interested  in  implementing 
on  a  synchronous  network.  More  accurately,  linear  distributed  algorithms 
on  synchronous  networks  are  discrete-time  linear  dynamical  systems  whose 
evolution  map  is  linear  and  has  a  sparsity  structure  related  to  the  network. 
These  algorithms  represent  an  important  class  of  iterative  algorithms  that 
find  applications  in  optimization,  in  the  solution  of  systems  of  equations  and 
in  distributed  decision  making,  see  for  instance  [Bertsekas  and  Tsitsiklis, 
1997].  In  this  section  we  present  some  relevant  results  on  distributed  linear 
algorithms. 


1.6.1  Linear  iterations  on  synchronous  networks 

Given  a  synchronous  network  S  =  ({1, . . . ,  n},£cmm),  assign  a  scalar  fji  ^  0 
to  each  directed  edge  (i,  j)  G  Ecmm.  Given  such  scalars  fji  for  (i,j)  G  Ecmm, 
the  Linear  combination  algorithm  over  S  is  defined  as  follows. 


Distributed  Algorithm:  Linear  combination 
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Figure  1.18  Execution  of  the  distributed  Bellman-Ford  algorithm,  (a)  depicts  the 
processor  state  initialization.  The  vertex  1  is  the  only  one  whose  variable 
dist  is  0.  After  3  iterations,  as  guaranteed  by  Lemma  1.55,  (d)  depicts  the 
resulting  shortest-paths  tree  of  the  digraph  rooted  at  vertex  1.  This  tree  is 
represented  in  the  last  frame  with  edges  colored  in  gray. 
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Alphabet:  A  =  KUnull 
Processor  state:  weR 

function  msg (w,i)  =  msgstd(u;,i) 

function  stf (w,y) 
i:  i  :=  processor  UID 
2:  return  fiiW  +  EjeA^(i)  foVi 


We  assume  that  each  processor  i  G  { 1 , . . . ,  n}  knows  the  scalars  fij ,  for 
j  G  A fm(i)  U  {i},  so  that  it  can  evaluate  the  state-transition  function.  Also, 
we  assume  that  real  numbers  may  be  transmitted  through  a  communica¬ 
tion  channel,  i.e. ,  we  neglect  quantization  issues  in  the  message-generation 
function. 

In  the  language  of  Section  1.3,  one  can  regard  the  Linear  combination 
algorithm  over  S  as  the  discrete-time  continuous-space  dynamical  system 
(X,Xq ,/),  with  X  =  Xq  =  M”  and  evolution  map  defined  by  f(w)  = 
F  ■  w,  where  we  define  a  matrix  F  G  Wnxn  with  vanishing  entries  except 
for  fji,  for  (i,  j)  G  Ecmm.  Note  that,  if  A(S)  denotes  the  adjacency  matrix 
of  the  digraph  S ,  then  the  entries  of  F  vanish  precisely  when  the  entries  of 
A(5)r  vanish.  With  this  notation,  the  evolution  w  :  Z>o  — ►  Mn  with  initial 
condition  wo  G  Mn  is  given  by 

w(0)  =  wo,  w(£  +  1)  =  F  ■  w(£),  i  G  Z>q.  (1.6.1) 


Conversely,  any  linear  algorithm  of  the  form  (1.6.1)  can  easily  be  cast  as 
a  Linear  combination  algorithm  over  a  suitable  synchronous  network. 
We  do  this  bookkeeping  carefully,  in  order  to  be  consistent  with  the  notion 
of  associated  weighted  digraph  from  Remark  1.33.  Given  F  G  MTlxn,  let 
Sf  be  the  synchronous  network  with  node  set  {1, . . . ,  n}  and  with  edge  set 
Ecmm(F)  defined  by  any  of  the  equivalent  statements. 

(i)  (i,j)  G  Ecmm(F)  if  and  only  if  fji  /  0;  or 

(ii)  Sf  is  the  reversed  and  unweighted  version  of  the  digraph  associated 
to  F. 


1.6.2  Averaging  algorithms 

In  what  follows  we  consider  linear  combination  algorithms  over  time-dependent 
weighted  directed  graphs;  we  restrict  our  analysis  to  nonnegative  weights. 
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Definition  1.56  (Averaging  algorithms).  The  averaging  algorithm  as¬ 
sociated  to  a  sequence  of  stochastic  matrices  {F{1)  \  t  6  Z>o}  C  Mnxn  is 
the  discrete-time  dynamical  system 

w(£+l)  =  F(£) -w(£),  fGZ>0.  (1.6.2) 


In  the  literature,  such  algorithms  are  often  referred  to  as  agreement  algo¬ 
rithms  or  as  consensus  algorithms. 

There  are  useful  ways  to  compute  a  stochastic  matrix,  and  therefore,  a 
time-independent  averaging  algorithm,  from  a  weighted  digraph;  see  Exer¬ 
cise  El. 14. 

Definition  1.57  (Adjacency-  and  Laplacian-based  averaging).  Let  G 

be  a  weighted  digraph  with  node  set  {1, . . . ,  n},  weighted  adjacency  matrix 
A,  weighted  out-degree  matrix  Dout,  and  weighted  Laplacian  L.  Then 

(i)  the  adjacency-based,  averaging  algorithm  is  defined  by  the  stochastic 
matrix  ( In  +  Dout)_1(/n  +  A)  and  reads  in  components 

1  n 

Wi{t  +  1)  =  — — 7 — -pr{wi(l)  +  ^2  OijWj  (£))■,  (1.6.3) 

j-  +  Uoutl^J  ^ 

(ii)  given  a  positive  scalar  e  upper  bounded  by  min{l/<iout(z)  |  i  e 
{l,...,n}},  the  Laplacian-based  averaging  algorithm  is  defined  by 
the  stochastic  matrix  In  —  eL{G )  and  reads  in  components 

n  n 

Wi{t  +  l)  =  (l-£  22  aij^)wi(£)  +  e  22  aijwj{£)-  (1-6.4) 

j=\, j^i  3=1,  j^i 

These  notions  are  immediately  extended  to  sequences  of  stochastic  matrices 
arising  from  sequences  of  weighted  digraphs.  • 

Adjacency-based  averaging  algorithms  arising  from  unweighted  (undirected) 
graphs  without  self-loops  are  also  known  as  the  equal-neighbor  averaging  rule 
or  the  Vicsek’s  model ;  see  [Vicsek  et  ah,  1995].  Specifically,  if  G  is  an  un¬ 
weighted  graph  with  vertices  {l,...,n}  and  without  self-loops,  then  the 
equal-neighbor  averaging  rule  is 

Wi(£+1)  =  |  j  GJVgW}),  (1.6.5) 

where  we  adopt  the  shorthand  avrg({x'i , . . . ,  Xk })  =  {x\  +  •  •  •  +  Xk)/k. 

Remark  1.58  (Sensing  versus  communication  interpretation  of  di¬ 
rected  edges).  In  the  definition  of  averaging  algorithms  arising  from  di- 
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graphs,  the  edges  of  the  digraph  play  the  role  of  “sensing  edges,”  not  that  of 
“communication  edges.”  In  other  words,  a  nonzero  entry  a%j ,  corresponding 
to  the  edge  (i,j)  in  the  digraph,  implies  that  the  zth  component  of  the  state 
is  updated  with  the  j th  component  of  the  state.  It  is  as  if  node  i  could  sense 
the  state  of  node  j,  rather  than  node  i  transmitting  to  node  j  its  own  state. 


Next,  we  present  the  main  stability  and  convergence  results  for  averag¬ 
ing  algorithms  associated  to  a  sequence  of  stochastic  matrices.  We  start 
by  discussing  equilibrium  points  and  their  stability.  Recall  that  ln  is  an 
eigenvector  of  any  stochastic  matrix  with  eigenvalue  1  and  that  the  diag¬ 
onal  set  diag(Mn)  is  the  vector  subspace  generated  by  ln.  Therefore,  any 
point  in  diag(Mn)  is  an  equilibrium  for  any  averaging  algorithm.  We  refer  to 
the  points  of  the  diag(Mn)  as  agreeement  configurations ,  since  all  the  com¬ 
ponents  of  an  element  in  diag(Rn)  are  equal  to  the  same  value.  We  will 
informally  say  that  an  algorithm  achieves  agreement  if  it  steers  the  network 
state  towards  the  set  of  agreement  configurations. 

Lemma  1.59  (Stability  of  agreement  configurations).  Any  averaging 
algorithm  in  Mn  is  uniformly  stable  and  uniformly  bounded  with  respect  to 
diag(Mn). 

Regarding  convergence  results,  we  need  to  introduce  a  useful  property  of 
collections  of  stochastic  matrices.  Given  a  G  ]0, 1],  the  set  of  non- degenerate 
matrices  with  respect  to  a  consists  of  all  stochastic  matrices  F  with  entries 
fij,  for  i,  j  G  {1, . . . ,  n},  satisfying 

fn  G  [a,  1],  and  fij  G  {0}  U[a,  1]  for  j  fi  i. 

Additionally,  the  sequence  of  stochastic  matrices  {F{1)  \  i  G  Z>o }  is  non¬ 
degenerate  if  there  exists  a  G  ] 0, 1]  such  that  F(£)  is  non- degenerate  with 
respect  to  a  for  all  l  G  Z>o-  We  now  state  the  main  convergence  resultand 
postpone  its  proof  to  Section  1.8.5. 

Theorem  1.60  (Convergence  for  time-dependent  stochastic  matri¬ 
ces).  Let  {F(l)  |  t  G  Z>o }  C  Wixn  be  a  non- degenerate  sequence  of  stochas¬ 
tic  matrices.  For  i  G  Z>o,  let  G(£)  be  the  unweighted  digraph  associated  to 
F(l),  according  to  Remark  1.33.  The  following  statements  are  equivalent: 

(i)  The  set  diag(Mn)  is  uniformly  globally  attractive  for  the  associated 
averaging  algorithm;  and 

(ii)  there  exists  a  duration  5  G  N  such  that,  for  all  i  G  Z>o,  the  digraph 

G{£  +  1)  U  •  •  •  U  G{£  +  5) 
contains  a  globally  reachable  vertex. 
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We  collect  a  few  observations  about  this  result. 


Remarks  1.61  (Discussion  of  Theorem  1.60). 

(i)  The  statement  in  Theorem  1.60(i)  means  that  each  solution  to  the 
time-dependent  linear  dynamical  system  (1.6.2)  converges  uniformly 
and  asymptotically  to  the  vector  subspace  generated  by  ln. 

(ii)  The  necessary  and  sufficient  condition  in  Theorem  1.60(H)  amounts 
to  the  existence  of  a  uniformly-bounded  time  duration  6  with  the 
property  that  a  weak  connectivity  assumptions  holds  over  each  col¬ 
lection  of  6  consecutive  digraphs.  We  refer  to  [Blondel  et  al.,  2005] 
for  a  counterexample  showing  is  the  duration  in  Theorem  1.60  is 
not  uniformly  bounded,  then  there  exist  algorithms  that  do  not 
converge. 

(iii)  According  to  Definition  1.20,  uniform  convergence  is  a  property  of 
all  solutions  to  system  (1.6.2)  starting  at  any  arbitrary  time,  and 
not  only  at  time  equal  to  0.  If  we  restrict  our  attention  to  solutions 
that  only  start  at  time  0,  then  Theorem  1.60  should  be  modified  as 
follows:  the  statement  in  Theorem  1.60(i)  implies,  but  is  not  implied 
by,  the  statement  in  Theorem  1.60(H). 

(iv)  The  theorem  applies  only  to  sequences  of  non-degenerate  matri¬ 

ces.  Indeed,  there  exist  sequences  of  degenerate  stochastic  matrices 
whose  associated  averaging  algorithms  converges.  Furthermore,  one 
does  not  even  need  to  consider  sequences  because  it  is  possible  to 
define  converging  algorithms  by  just  considering  a  single  stochastic 
matrix.  Precisely  when  the  stochastic  matrix  is  primitive  we  already 
know  that  the  associated  averaging  algorithm  will  converge  (see 
Theorem  1.11).  Examples  of  degenerate  primitive  stochastic  ma¬ 
trices  (with  converging  associated  averaging  algorithms)  are  given 
in  Exercise  El. 22.  We  discuss  time-invariant  averaging  algorithms 
in  Proposition  1.65  below.  • 

Theorem  1.60  gives  a  general  result  about  non-degenerate  stochastic  ma¬ 
trices  that  are  not  necessarily  symmetric.  The  following  theorem  presents 
a  convergence  result  for  the  case  of  symmetric  matrices  (i.e. ,  undirected 
digraphs)  under  connectivity  requirements  that  are  weaker  (i.e.,  the  dura¬ 
tion  does  not  need  to  be  uniformly  bounded)  than  the  ones  expressed  in 
statement  (ii)  of  Theorem  1.60. 

Theorem  1.62  (Convergence  for  time-dependent  symmetric  stochas¬ 
tic  matrices).  Let  {F{£)  \  i  £  Z>o}  C  Mnxn  be  a  non-degenerate  sequence 
of  symmetric,  stochastic  matrices.  For  £  £  Z>o,  let  G{£)  be  the  unweighted 
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graph  associated  to  F(£),  according  to  Remark  1.33.  The  following  state¬ 
ments  are  equivalent: 

(i)  The  set  diag(Mn)  is  globally  attractive  for  the  associated  averaging 
algorithm;  and 

(ii)  for  all  £  G  Z>o,  the  graph 

r>£ 

is  connected. 

Let  us  briefly  particularize  our  discussion  here  on  adjacency-  and  Laplacian- 
based  averaging  algorithms. 

Corollary  1.63  (Convergence  of  adjancency-  and  Laplacian-based 
averaging  algorithms).  Let  {G(£)  \  £  G  Z>o}  C  Rnxn  be  a  sequence  of 
weighted  digraphs.  The  following  statements  are  equivalent: 

(i)  There  exists  5  G  N  such  that,  for  all  £  G  Z> o,  the  digraph 

G(£  +  1)  U  •  •  •  U  G{£  +  d) 
contains  a  globally  reachable  vertex; 

(ii)  the  set  diag(Mn)  is  uniformly  globally  attractive  for  the  the  time- 
dependent  adjancency-based  averaging  algorithm  (1.6.3);  and 

(in)  the  set  diag(Mn)  is  uniformly  globally  attractive  for  the  the  time- 
dependent  Laplacian-based  averaging  algorithm  (1.6.4)  (defined  with 
e  <  1/n). 

Finally,  we  refine  the  results  presented  thus  far  by  discussing  some  further 
aspects. 

Proposition  1.64  (Convergence  to  a  point  in  the  invariant  set). 

Under  the  assumptions  in  Theorem  1.60  and  assuming  that  diag(Mn)  is  uni¬ 
formly  globally  attractive  for  the  averaging  algorithm,  each  individual  evolu¬ 
tion  converges  to  a  specific  point  o/diag(Mn),  rather  than  converging  to  the 
whole  set. 

In  general,  the  specific  value  upon  which  all  Wi,  i  G  {1, . . .  ,n},  agree  in 
the  limit  is  unknown.  Clearly,  this  agreement  value  depends  on  the  initial 
condition  and  the  specific  sequence  of  matrices  defining  the  time-dependent 
linear  algorithm.  In  some  cases,  however,  by  restricting  the  class  of  allowable 
matrices,  we  can  elucidate  the  common  limit  value.  We  consider  two  impor¬ 
tant  settings:  time-independent  averaging  algorithms  and  doubly-stochastic 
averaging  algorithms. 
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First,  we  specialize  the  main  convergence  result  to  the  case  of  time- 
independent  averaging  algorithms.  Note  that,  given  a  stochastic  matrix 
F,  convergence  of  the  averaging  algorithm  associated  to  F  for  all  initial 
conditions  is  equivalent  to  the  matrix  F  being  semi-convergent  (see  Defini¬ 
tion  1.7). 

Proposition  1.65  (Time-independent  averaging  algorithm).  Con¬ 
sider  the  linear  dynamical  system  on  Mn 

w(£  +  l)  =  Fw(i),  i£Z>0.  (1.6.6) 

Assume  F  £  Mnxr7,  is  stochastic,  let  G(F)  denote  its  associated  weighted 
digraph,  and  let  v  £  Mn  be  a  left  eigenvector  of  F  with  eigenvalue  1.  Assume 
either  one  of  the  two  following  properties: 

(i)  F  is  primitive  (i.e.,  G(F)  is  strongly  connected  and  aperiodic);  or 

(ii)  F  has  non-zero  diagonal  terms  and  a  column  of  Fn~x  has  positive 
entries  (i.e.,  G(F)  has  self-loops  at  each  node  and  has  a  globally 
reachable  node). 

Then  every  trajectory  w  of  system  (1.6.6)  converges  to  (vTw( 0)/u7  ln)ln. 


Proof.  From  Theorem  1.60  we  know  that  the  dynamical  system  (1.6.6)  con¬ 
verges  if  property  (ii)  holds.  The  same  conclusion  follows  if  F  satisfies 
property  (i)  because  of  Perron-Frobenius  Theorem  1.9  and  Lemma  1.8.  To 
computing  the  limiting  value,  note  that 

vTw(l  +  1)  =  vtFw(£)  =  vTw(t), 

that  is,  the  quantity  1 1— >  vTw(i)  is  constant.  Because  F  is  semi-convergent 
and  stochastic,  we  know  that  lim^_>+00  w{l)  =  aln  for  some  a.  To  conclude, 
we  compute  a  from  the  relationship  a(vT ln)  =  lim^_>+0O  vTw(£)  =  uTu;(0). 


Remarks  1.66  (Alternative  conditions  for  time-independent  aver¬ 
aging). 

(i)  The  following  necessary  and  sufficient  condition  generalizes  and  is 
weaker  than  the  two  sufficient  conditions  given  in  Proposition  1.65: 
every  trajectory  of  system  (1.6.6)  is  asymptotically  convergent  if  and 
only  if  all  sinks  of  the  condensation  digraph  of  G(F)  are  aperiodic 
subgraphs  of  G(F).  We  refer  the  interested  reader  to  [Meyer,  2001, 
Chapter  8]  for  the  proof  of  this  statement  and  for  the  related  notion 
of  ergodic  classes  of  a  Markov  chain.  Also,  we  refer  the  interested 
reader  to  Exercise  El.  12  for  the  notion  of  condensation  digraph. 
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(ii)  Without  introducing  any  trajectory  w,  the  result  of  the  proposition 
can  be  equivalently  stated  by  saying  that 

lim  Fl  =  (vTln)~1lnvT .  • 

e->+oo 


Second,  we  specialize  the  main  convergence  result  to  the  case  of  doubly 
stochastic  averaging  algorithms. 

Corollary  1.67  (Average  consensus).  Let  {F{£)  \  £  E  Z>o }  be  a  se¬ 
quence  of  stochastic  matrices  as  in  Theorem  1.60.  If  all  matrices  F(£), 
£  E  Z>o,  are  doubly  stochastic,  then  every  trajectory  w  of  the  averaging 
algorithms  satisfies 

n  n 

£«)  =  £  Wi  (0) ,  for  all  £ , 

2=1  i— 1 

that  is,  the  sum  of  the  initial  conditions  is  a  conserved  quantity.  There¬ 
fore,  if  {F(£)  |  £  E  Z>o}  is  non- degenerate  and  satisfies  property  (ii)  in 
Theorem  1.60,  then 

1  n 

lim  Wj(£)  =  -V'wi(O),  j  E  {l,...,n}. 

£^+oa  n 


Proof.  The  proof  of  the  first  fact  is  an  immediate  consequence  of 

n  n 

^2  Wi{£  +  1)  =  llw{£  +  1)  =  1  nF(£)w(£)  =  1  lw{£)  =  J2 
i= 1  2—1 

The  second  fact  is  an  immediate  consequence  of  the  first  fact.  ■ 


In  other  words,  if  the  matrices  are  double  stochastic,  then  each  compo¬ 
nent  of  the  trajectories  will  converge  to  the  average  of  the  initial  condition. 
We  therefore  adopt  the  following  definition:  an  average- consensus  averaging 
algorithm  is  an  averaging  algorithm  whose  sequence  of  stochastic  matrices 
are  all  doubly  stochastic. 


1.6.3  Convergence  speed  of  averaging  algorithms 

We  know  that  any  trajectory  of  the  associated  averaging  algorithm  converges 
to  the  diagonal  set  diag(Mn);  in  what  follows  we  characterize  how  fast  this 
convergence  takes  place.  We  begin  with  some  general  definitions  for  semi- 
convergent  matrices  (recall  the  discussion  culminating  in  Lemma  1.8). 
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Definition  1.68  (Convergence  time  and  exponential  convergence 
factor).  Let  A  E  Mnxn  be  semi-convergent  with  limit  lim^+00  Ae  =  A*. 


(i)  For  e  E  ] 0, 1  [,  the  e-convergence  time  of  A  is  the  smallest  time 
Te(A)  E  Z>o  such  that,  for  all  xo  E  M"1  and  £  >  Te{A), 

\\Aex0  -  A*x0||2  <  e||x0  -  A*xo  ||  2 • 

(ii)  The  exponential  convergence  factor  of  A,  denoted  by  rexp(A)  E 
[0, 1[,  is 


f’exp  (A) 


sup 

xq^A-xo 


lim  sup 

•£— >+00 


/ \\A1xq  -  A*xq||2\ 
V  ||xq  -  A*Xq\\2  / 


The  exponential  factor  of  convergence  has  the  following  interpretation:  If 
the  trajectory  x{£)  =  A^x o  maximizing  the  sup  operator  has  the  form  x{£)  = 
/r(xo  —  x*)  +  x*,  for  p  <  1,  then  it  is  immediate  to  see  that  rexp(A)  =  p. 

Lemma  1.69  (Exponential  convergence  factor  of  a  convergent  ma¬ 
trix).  If  A  is  a  convergent  matrix,  then  rexp(A)  =  p(A). 

In  what  follows  we  are  interested  in  studying  how  the  convergence  time 
and  exponential  convergence  factor  of  a  matrix  depend  upon  e  and  upon  the 
dimension  of  the  matrix  itself. 


Remark  1.70  (Complexity  notions).  Analogously  to  the  treatment  in 
Section  1.5.2,  we  introduce  some  complexity  notions.  Let  An  E  Mnxn,  n  E  N, 
be  a  sequence  of  semi-convergent  matrices  with  limit  lim^+oo  An  =  Ko  and 
let  e  E  ]0, 1].  We  say  that 

(i)  Te{An)  is  of  order  Q(f(n,e))  if,  for  all  n  and  all  e,  there  exists  an 
initial  condition  xq  E  Mn  such  that  ||  A^xq  —  A*xo||2  >  e||xo  —  A*xo||2 
for  all  times  t  greater  than  a  constant  factor  times  /(n,  e); 

(ii)  Te(An)  is  of  order  0{f{n,£ ))  if,  for  all  n  and  all  e,  Te(An)  is  less 
than  or  equal  to  a  constant  factor  times  /(n,  e);  and 

(iii)  Te{An)  is  of  order  Q(f(n,e))  if  it  is  both  of  order  fl(/(n,  e))  and  of 

order  0(/(n,  e)).  • 


Lemma  1.71  (Asymptotic  relationship).  Let  An  E  M.nxn,  n  E  N,  be 
a  sequence  of  semi- convergent  matrices  and  let  e  E  ]0, 1] .  In  the  limit  as 
£  — »  0+  and  as  n  — >  +oo, 


T£(An)  E  O 


( - 

'  1  —  1"exp{An) 


loge  1 


Proof.  By  definition  of  exponential  convergence  factor  and  of  limsup,  we 
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know  that  for  all  77  >  0,  there  exists  N  such  that  for  all  £  >  N, 

||^X0  -  vl*Xo||2  <  OexpOln)  +  r]Y\\x0  ~  -4*^0  ||-2- 

The  e-convergence  time  is  upper  bounded  by  any  £  such  that  ( rexp(An )  + 
r)Y  <  e.  Selecting  rj  =  (1  —  rexp(An)) /2,  straightforward  manipulations  lead 
to 

£  > - log  e^1 

-  -log((rexp(4n)  +  l)/2)  g  ' 

It  is  also  immediate  to  note  that  >  _  iog((r+i)/2)  >  f°r  r  C  ]0, 1[.  This 
establishes  the  bound  in  the  statement  above.  ■ 


Next,  we  apply  the  notion  of  convergence  time  and  exponential  conver¬ 
gence  factor  to  any  non-degenerate  stochastic  matrix  whose  associated  di¬ 
graph  has  a  globally  reachable  node. 

Lemma  1.72  (Exponential  convergence  factor  of  stochastic  matri¬ 
ces).  Let  F  be  a  non- degenerate  stochastic  matrix  whose  associated  digraph 
has  a  globally  reachable  node.  Then 

^exp(T)  =  pess(E). 

(From  equation  (1.2.1)  recall  pess(F)  =  max{||A||c  |  A  G  spec(F)  \  {1}}J 


Proof.  If  v  G  Mn  is  a  left  eigenvector  of  F,  then,  as  in  Proposition  1.65, 

lim  F£  =  F*  =  (vT ln)-llnvT. 

1^  +  00 

Relaying  upon  vT F  =  vT  and  Fln  =  ln,  straightforward  manipulations 
show  that  F*  =  F*F  =  FF*  =  F*F*  and  in  turn 


pe+1  _  p*  =  (F  —  F*){Ft  —  F*). 

For  any  wq  G  Mn  such  that  wq  F*wq,  define  the  error  variable  e(£)  :  = 
F^wq  —  F*wq.  Note  that  the  error  variable  evolves  according  to  e(£  +  1)  = 
(F  —  F*)e(£ )  and  converges  to  zero.  Additionally,  the  rate  at  which  w(£)  = 
F£w o  converges  to  F*wq  is  the  same  at  which  e(£)  converges  to  zero,  that 
is, 

rexp(F  —  F*)  =  rexp(F). 


Therefore, 


rexp(E)  =  rexp(F  -  F*)  =  p{F  -  F*)  =  pess(F). 
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The  following  result  establishes  bounds  on  convergence  factors  and  conver¬ 
gence  times  for  stochastic  matrices  arising  from  the  equal- neighbor  averaging 
rule  in  equation  (1.6.5). 

Theorem  1.73  (Bounds  on  the  convergence  factor  and  the  conver¬ 
gence  time).  Let  G  be  an  undirected  unweighted  connected  graph  of  order 
n  and  let  e  E  ]0, 1].  Define  the  stochastic  matrix  F  =  (In  +  D(G))~1(In  + 
A(G)).  There  exists  7  >  0  (independent  of  n)  such  that  the  exponential 
convergence  factor  and  convergence  time  of  F  satisfy 

rexp{F)  <  1  -  7 n-3,  and  Te{F )  E  0(n3  log e_1), 
as  e  — *  0+  and  n  — >  +00. 


1.6.4  Algorithms  defined  by  tridiagonal  Toeplitz  and  tridiagonal  circulant 
matrices 


This  section  presents  a  detailed  analysis  of  the  convergence  rates  of  linear 
distributed  algorithms  defined  by  tridiagonal  Toeplitz  matrices  and  by  cer¬ 
tain  circulant  matrices.  Let  us  start  by  introducing  the  family  of  matrices 
under  study.  For  n  >  2  and  a,  b,  c  E  M,  define  the  n  x  n  matrices  Tridn(a,  b,  c) 
and  Circ n(a,b,c)  by 


and 


Trid  n(a,b,c) 


be  0  ...  0 

a  b  c  0 

0  . . .  a  b  c 

0  . . .  0  a  b 


Circn(a,  b ,  c)  =  Tridn(a,  b,  c )  + 


"o 

...  0 

a 

0 

...  0 

0 

0 

0 

...  0 

0 

c 

0 

...  0 

0 

We  call  the  matrices  Tridn  and  Circn  tridiagonal  Toeplitz  and  tridiagonal 
circulant,  respectively.  The  two  matrices  only  differ  in  their  (1,  n)  and  (n,  1) 
entries.  Note  our  convention  that 


Circ2(a,  b,  c ) 


b  a  +  c 
a  +  c  b 


Note  that,  for  a  =  0  and  c  /  0  (alternatively,  a/0  and  c  =  0),  the  syn- 
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chronous  networks  defined  by  Trid(a,  b,  c)  and  Circ(a,  b,  c)  are,  respectively, 
the  chain  and  the  ring  digraphs  introduced  in  Section  1.4.  If  both  a  and 
c  are  non- vanishing,  then  the  synchronous  networks  are,  respectively,  the 
undirected  versions  of  the  chain  and  the  ring  digraphs. 


Now,  we  characterize  the  eigenvalues  and  eigenvectors  of  Tridn  and  Circn. 


Lemma  1.74  (Eigenvalues  and  eigenvectors  of  tridiagonal  Toeplitz 
and  tridiagonal  circulant  matrices).  For  n  >  2  and  a,b,c  £  R,  the 
following  statements  hold: 


(i)  For  ac  0,  the  eigenvalues  and  eigenvectors  of  Tridn(a,  b,  c)  are, 
for  i  £  {1, . . .  ,n}, 


b  +  2c^  /  -  cos 


n r 

n  +  1 


£  C, 


(?) 

(?) 


1/2 

sin 

f  m  N 

,  n+i 

)) 

2/2 

sin 

'  2i-rt  ^ 

n/2 

sin 

nin 

[n+l/ 

V 

€C"; 


(ii)  the  eigenvalues  and  eigenvectors  o/Circn(a,  6,  c)  are,  for  co  =  exp(^(— 
and  for  i  £  {1, . . . ,  n}, 


i2ir\ 


i2ir\ 


b  +  (a  +  c)  cos  ^ - J  +  v —  1  (c  —  a)  sin  ^ - J  £  C, 

and  (l,cai,...,Jn-^i)T  £  Cn. 


Proof.  Both  facts  are  discussed,  for  example,  in  [Meyer,  2001,  Example  7.2.5 
and  Exercise  7.2.20].  Fact  (ii)  requires  some  straightforward  algebraic  ma¬ 
nipulations.  ■ 


Figure  1.19  illustrates  the  location  of  the  eigenvalues  of  these  matrices  in 
the  complex  plane. 


Remarks  1.75  (Inclusion  relationships  for  eigenvalues  of  tridiago¬ 
nal  Toeplitz  and  tridiagonal  circulant  matrices). 

(i)  The  set  of  eigenvalues  of  Tridn  (a,  b,  c)  is  contained  in  the  real  interval 
[b  —  2-^/ac,  b  +  2^/ac},  if  ac  >  0,  and  in  the  interval  in  the  complex 
plane  [6  —  2\f^l \f\ac\,  b  +  2v/-M\/|ac|],  if  ac  <  0. 

(ii)  The  set  of  eigenvalues  of  Circn(o,  b,  c )  is  contained  in  the  ellipse  on 

the  complex  plane  with  center  b,  horizontal  axis  2|a+c|,  and  vertical 
axis  2 1 c  —  a\.  • 
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Figure  1.19  The  eigenvalues  of  Toeplitz  and  circulant  matrices,  cf.  Lemma  1.74,  are 
closely  related  with  the  roots  of  unity.  Plotted  in  the  complex  plane,  the 
black  discs  correspond  in  (a)  to  the  eigenvalues  of  Tridi3(a,  b,  c),  and  in  (b) 
to  the  eigenvalues  of  CirciRO,  b,  c). 


Next,  we  characterize  the  convergence  rate  of  linear  algorithms  defined  by 
tridiagonal  Toeplitz  and  tridiagonal  circulant  matrices.  As  in  the  previous 
section  we  are  interested  in  asymptotic  results  as  the  system  dimension 
n  — >  Too  and  as  the  accuracy  parameter  e  goes  to  0+ . 

Theorem  1.76  (Linear  algorithms  defined  by  tridiagonal  Toeplitz 
and  tridiagonal  circulant  matrices).  Let  n  >  2,  e  E  ] 0, 1[,  and  a,b,c  € 
R.  Let  x  :  Z>o  — >  Rn  and  y  :  Z>q  — >  Rn  be  solutions  to 


x{£  T  1)  =  Tridn(a,  b,  c )  x(£),  y{l  T  1)  =  Circn(a,  b,  c)  y(£), 


with  initial  conditions  x(0)  =  xq  and  y( 0)  =  yo,  respectively.  The  following 
statements  hold: 

(i)  If  a  =  c  ^  0  and  \b\  T  2|a|  =  1,  then  lim^+oo  x(£)  =  0n  with 
e- convergence  time  in  0(n2loge_1); 

(ii)  if  a  /  0,  c  =  0  and  0  <  |6|  <  l,  then  lim^+00  x{£)  =  0n  with 
e-convergence  time  in  0{n\ogn  T  loge-1);  and 

(Hi)  if  a  >  0,  c  >  0,  1  >  b  >  0,  and  a  +  b  +  c  =  1,  then  lim^+00  y(£)  = 
yave In,  where  yave  =  with  £-convergence  time  in  0(n2  loge-1) . 

The  proof  of  this  result  is  reported  in  Section  1.8.6.  Next,  we  extend 
these  results  to  another  interesting  set  of  tridiagonal  matrices.  For  n  >  2 
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and  a,  b  E  R,  define  the  n  x  n  matrices  ATYid^"  (a,  b)  and  ATridn  (a,  b )  by 


ATrid^(a,  b )  =  Tridn(a,  6,  a)  ± 


We  refer  to  these  matrices  as  augmented  tridiagonal  matrices.  If  we  define 


a 

0  . 

o’ 

0 

0  . 

0 

0 

.  0 

0 

0 

.  0 

a 

P+  = 


1  1 

1  -1 

1  0 


0 

1 

-1 


P_  = 


0 

0 

1 

-1 

1 


(_l)n-2 


1 

1 

0 

0 

0 


0 

0 

1 

0 

0 

0 

1 

1 


-1 

0 

0 

0 

1 

0 

0 


0 

0 

0 

1 

-1 


1 

0 


then  the  following  similarity  transforms  are  satisfied: 


ATridn  (a,  6)  =  p± 


b±2a  0 

0  Tridn_i(a,  6,  a) 


o-l 


(1.6.7) 


To  analyze  the  convergence  properties  of  the  linear  algorithms  determined 
by  ATridn  (a,  and  ATrid”  (a,  b),  we  will  find  useful  to  consider  the  vector 

in-  =  (1,  -1, 1,  ■  ■  • ,  (~i)n-2,  (-i)n_1)T  e  R". 

In  the  following  theorem  we  will  not  assume  that  the  matrices  of  interest 
are  semi-convergent.  We  will  establish  convergence  to  a  trajectory,  rather 
than  to  a  fixed  point.  For  e  E  ]0, 1[,  we  say  that  a  trajectory  x  :  Z>o  — ► 
converges  to  xgnai  :  Z>o  — ►  Rn  with  convergence  time  Te  £  Z>o  if 

(i)  \\x{£)  —  £Cfinai (^) || 2  — ►  0  as  £  — >  +oo;  and 

(ii)  T£  is  the  smallest  time  such  that  \\x(£)  —  XfinaiCOIh  <  e||x(0)  — 
®finai(0)||2,  for  all  £  >  Te. 

Theorem  1.77  (Linear  algorithms  defined  by  augmented  tridiag¬ 
onal  matrices).  Let  n  >  2,  e  £  ]0, 1[,  and  a,  b  £  R  with  a  ^  0  and 
\b\  +  2 1 a|  =  1.  Let  x  :  Z>o  — >  Rn  and  z  :  Z>o  — ■»  Mn  be  solutions  to 

x(£  +  1)  =  ATridn  (a>  &)  x(p)i  Z(V-  +  1)  =  ATrid”  (a,  b)  z(£), 
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with  initial  conditions  x(0)  =  xo  and  z(0)  =  zq,  respectively.  The  following 
statements  hold: 

(i)  lim^+oo  (x(£)-xave(£)l n)  =  0„,  where  xave(£)  =  (^l^x0)(b+2a)1 , 
with  e-convergence  time  in  0(n2  loge”1);  and 

(a)  lim^+00  (z(£)  -  zave(£)ln-)  =  o n,  where  zave(£ )  =  (ylj^zo)^  - 
2 ay,  with  e-convergence  time  in  0(n2  logs-1) . 

The  proof  of  this  result  is  reported  in  Section  1.8.6. 

Remark  1.78  (From  Toeplitz  to  stochastic  matrices).  A  tridiagonal 
Toeplitz  matrix  is  not  stochastic  unless  its  off-diagonal  elements  are  zero. 
The  tridiagonal  circulant  matrices  Circn  and  augmented  tridiagonal  ma¬ 
trices  ATrid„  studied  in  Theorem  1.76(iii)  and  Theorem  1.77(i)  are  slight 
modifications  of  tridiagonal  Toeplitz  matrices  and  are  doubly  stochastic.  In¬ 
deed,  note  that  the  evolutions  converge  to  the  average  consensus  value,  as 
predicted  by  Corollary  1.67.  Note  that  convergence  times  obtained  for  Circn 
and  ATrid,^  are  consistent  with  the  upper  bound  predicted  by  Theorem  1.73. 


We  conclude  this  section  with  some  useful  bounds. 

Lemma  1.79  (Bounds  on  vector  norms).  Assume  x  £  Mn,  y  £  Mn_1, 
and  z  £  Mn_1  jointly  satisfy 


X  =  P+ 

'o' 

'o' 

,  X  =  P- 

y 

Then  |||x||2  <  ||y||2  <  (n  -  l)|]x||2  and  |||x||2  <  ||z||2  <  (n  -  l)||x||2. 

The  proof  of  this  result  is  based  on  spelling  out  the  coordinate  expressions 
for  x,  y,  and  z,  and  is  left  to  the  reader  as  Exercise  El. 28. 


1.7  NOTES 

Dynamical  systems  and  stability  theory 


Our  definition  of  state  machine  is  very  basic;  more  general  definitions  of 
state  machines  can  be  found  in  the  literature,  e.g.,  see  [Sipser,  2005],  but 
the  one  presented  in  this  chapter  are  enough  of  our  purposes. 

The  literature  on  dynamical  and  control  systems  is  vast.  The  main 
tool  we  use  in  later  chapters  is  the  LaSalle  Invariance  Principle,  obtained 
by  LaSalle  [1960]  and  discussed  in  [LaSalle,  1986];  see  also  the  earlier  works 
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by  [Barbasin  and  Krasovski!,  1952]  and  [Krasovski!,  1963]  for  related  ver¬ 
sions.  Example  relevant  references  include  modern  texts  on  dynamical  sys¬ 
tems  [Guckenheimer  and  Holmes,  1990],  linear  control  systems  [Chen,  1984], 
nonlinear  control  systems  [Khalil,  2002],  robust  control  [Dullerud  and  Paganini, 
2000],  and  discrete-event  systems  [Cassandras  and  Lafortune,  2007]. 


Graph  theory 

The  basic  definitions  of  graph  theory  are  standard  in  the  literature;  e.g., 
see  [Diestel,  2005,  Godsil  and  Royle,  2001,  Biggs,  1994],  The  discussion 
about  graph  algorithms  is  taken  from  [Cormen  et  ah,  2001],  which  also  con¬ 
tains  detailed  discussion  on  implementation  and  complexity  issues.  Regard¬ 
ing  Section  1.4. 4. 4,  standard  references  on  combinatorial  optimization  in¬ 
clude  [Korte  and  Vygen,  2005,  Vazirani,  2001]. 

In  Section  1.4.5,  all  statements  about  powers  of  the  adjacency  matrix 
are  standard  results  in  algebraic  graph  theory,  e.g.,  see  [Godsil  and  Royle, 
2001,  Biggs,  1994].  Lemma  1.24  is  a  recent  result  from  [Moreau,  2005, 
Lin  et  ah,  2005].  Proposition  1.32,  on  the  fact  that  a  weighted  digraph 
is  aperiodic  and  irreducible  if  and  only  if  its  adjacency  matrix  is  primi¬ 
tive,  is  related  to  standard  results  in  the  theory  of  Markov  chains,  e.g., 
see  [Seneta,  1981,  Meyer,  2001].  Our  proof  adopts  the  approach  in  [Lin, 
2005].  Laplacian  matrices  have  numerous  remarkable  properties;  two  el¬ 
egant  surveys  are  [Mohar,  1991,  Merris,  1994],  Theorem  1.34  character¬ 
izing  the  properties  of  the  Laplacian  matrix  contains  recent  results.  A 
proof  of  statement  (ii)  is  given  in  [Olfati-Saber  and  Murray,  2004];  in  our 
proof  we  follow  the  approach  in  [Francis,  2006].  Statement  (iii)  is  proved 
by  Lin  et  ah  [2005],  Francis  [2006];  the  following  equivalent  version  is  proved 
in  [Ren  and  Beard,  2005]:  a  weighted  digraph  G  contains  a  spanning  tree  if 
and  only  if  rank(L(rev(G)))  =  n  —  1.  Regarding  statement  (iv),  the  equiva¬ 
lence  between  (iv)a  and  (iv)b  is  proved  by  Olfati-Saber  and  Murray  [2004] 
and  the  equivalence  between  (iv)b  and  (iv)c  is  proved  by  Moreau  [2005]. 


Distributed  algorithms 

Our  discussion  of  distributed  algorithms  is  extremely  incomplete.  We  only 
presented  a  few  token  ideas  and  we  refer  to  the  textbooks  by  Lynch  [1997] 
and  Peleg  [2000]  for  detailed  treatments.  Let  us  mention  briefly  that  many 
more  efficient  algorithms  are  available  in  the  literature,  for  example,  the  GHS 
algorithm  [Gallager  et  ah,  1983]  for  minimum  spanning  tree  computation 
and  consensus  algorithms  with  communication  and  processors  faults;  much 

66 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


attention  is  dedicated  to  fault  tolerance  in  asynchronous  systems  with  shared 
memory  and  in  asynchronous  network  systems. 


Linear  distributed  algorithms 

Distributed  linear  algorithms  and  in  particular,  averaging  iterations  that 
achieve  consensus  among  processors,  have  a  long  and  rich  history.  The 
richness  comes  from  the  vivid  analogies  with  physical  process  of  diffusion, 
with  Markov  chain  models,  and  with  the  sharp  theory  of  positive  matrices 
developed  by  Perron  and  Frobenius.  What  follows  is  a  necessarily  incom¬ 
plete  list.  An  early  reference  on  averaging  opinions  and  achieving  consensus 
is  [DeGroot,  1974].  An  early  reference  on  the  connection  between  averag¬ 
ing  algorithms,  the  products  of  stochastic  matrices  and  ergodicity  in  in¬ 
homogeneous  Markov  chains  is  [Chatterjee  and  Seneta,  1977]  -  the  history 
of  inhomogeneous  Markov  chains  being  a  classic  topic  since  the  early  20th 
century.  The  stochastic  setting  was  investigated  in  [Cogburn,  1984],  Load 
balancing  with  divisible  tasks  in  parallel  computers  is  discussed  in  [  'ybenko, 

1989].  A  comprehensive  theory  of  asynchronous  parallel  processors  imple¬ 
menting  distributed  gradient  methods  and  time-dependent  averaging  algo¬ 
rithms  is  developed  in  the  series  of  works  [Tsitsiklis,  1984,  Tsitsiklis  et  al., 

1986,  Bertsekas  and  Tsitsiklis,  1997].  Much  interest  for  averaging  algorithms 
arose  from  the  influential  work  on  flocking  by  .Jadbabaie  et  al.  [2003].  Sharp 
conditions  for  convergence  for  the  time-dependent  setting  were  obtained 
in  [Moreau,  2005].  Finally,  proper  attention  to  the  average  consensus  prob¬ 
lem  was  given  in  [Olfati-Saber  and  Murray,  2004]. 

Regarding  Theorem  1.60  characterizing  the  convergence  of  averaging  al¬ 
gorithms  defined  by  sequences  of  stochastic  matrices,  we  note  that  (1) 
the  PhD  thesis  [Tsitsiklis,  1984]  established  convergence  under  a  strong- 
connectivity  assumption,  (2)  a  sufficient  condition  was  independently  re¬ 
discovered  in  [Jadbabaie  et  al.,  2003]  adopting  a  result  from  [Wolfowitz, 

1963],  and  (3)  [Moreau,  2003,  2005]  obtained  the  necessary  and  sufficient 
condition  (for  uniform  convergence  in  non-degenerate  sequences)  involving 
the  existence  of  a  uniformly  globally  reachable  node.  The  work  in  [Moreau, 

2003,  2005]  is  an  early  reference  also  for  Theorem  1.62;  additional  related  re¬ 
sults  and  a  historical  discussion  appeared  in  [Blondel  et  al.,  2005,  Hendrickx, 

2008].  The  estimates  of  the  on  convergence  factor  given  in  Theorem  1.73  in 
Section  1.6.3  were  proved  by  Landau  and  Odlyzko  [1981].  Our  treatment  of 
in  Section  1.6.4  follows  [Martinez  et  al.,  2007a]. 

Among  the  numerous  recent  directions  of  research  on  consensus  and  aver¬ 
aging  we  mention:  continuous-time  consensus  algorithms  [Olfati-Saber  and  Murray, 

2004,  Moreau,  2004,  Lin  et  al.,  2004,  Ren  and  Beard,  2005,  Lin  et  al.,  2005, 
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2007c],  consensus  over  random  networks  [Hatano  and  Mesbahi,  2005,  Wu, 

2006,  Porfiri  and  Stilwell,  2007,  Tahbaz-Salehi  and  Jadbabaie,  2008,  Picci  and  Taylor, 

2007,  Fagnani  and  Zampieri,  2008,  Patterson  et  al.,  2007],  consensus  in  finite¬ 
time  [Cortes,  2006,  Sundaram  and  Hadjicostis,  2008],  consensus  algorithms 
for  general  functions  [Bauso  et  al.,  2006,  Cortes,  2008b,  Lorenz  and  Lorenz, 

2008,  Sundaram  and  Hadjicostis,  2008],  connections  with  the  heat  equa¬ 
tion  and  partial  difference  equation  [Ferrari-Trecate  et  ah,  2006],  spatially- 
decaying  interactions  [Cucker  and  Smale,  2007],  convergence  in  time-delayed 
and  asynchronous  settings  [Blondel  et  ah,  2005,  Angeli  and  Bliman,  2006, 

Fang  and  Antsaklis,  2008],  quantized  consensus  problems  [Savkin,  2004,  Kashyap  et  ah, 
2007,  Carli  et  ah,  2008a,  Zhu  and  Martinez,  2008b],  consensus  on  mani¬ 
folds  [Sarlette  and  Sepulchre,  2007,  Scardovi  et  ah,  2007,  Igarashi  et  ah,  2007], 
applications  to  distributed  signal  processing  [Spanos  et  ah,  2005,  Xiao  et  ah, 

2005,  Olfati-Saber  et  ah,  2006,  Zhu  and  Martinez,  2008a],  characterization 
of  the  convergence  rates  and  time  complexity  [Landau  and  Odlyzko,  1981, 

Olshevsky  and  Tsitsiklis,  2007,  Carli  et  ah,  2008b,  Cao  et  ah,  2008].  Nu¬ 
merous  interesting  results  are  reported  in  the  recent  PhD  theses  [Lin,  2005, 

Lorenz,  2007,  Cao,  2007,  Hendrickx,  2008,  Carli,  2008].  Finally,  we  point 
out  two  recent  surveys  [Olfati-Saber  et  ah,  2007,  Ren  et  ah,  2007]  and  the 
text  [Ren  and  Beard,  2008]. 

Synchronization  is  a  fascinating  topic  related  to  averaging  algorithms. 

A  very  early  reference  is  the  work  by  Huygens  [1673]  on  coupled  pen- 
dula.  The  synchronization  of  oscillators  in  dynamical  systems  has  received 
increasing  attention  and  key  references  include  [Wiener,  1958,  Ivuramoto, 

1975,  Winfree,  1980,  Kuramoto,  1984,  Strogatz,  2000,  Nijmeijer,  2001];  see 
also  the  widely  accessible  [Strogatz,  2003].  Under  all-to-all  interactions, 

Mirollo  and  Strogatz  [1990]  prove  synchronization  of  a  collection  of  “in¬ 
tegrate  and  fire”  biological  oscillators.  Recent  works  on  the  Kuramoto 
and  other  synchronized  oscillator  models  include  [.Jadbabaie  et  ah,  2004, 

Chopra  and  Spong,  2008,  Triplett  et  ah,  2006,  Papachristodoulou  and  Jadbabaie, 

2006,  Wang  and  Slotine,  2006]. 

1.8  PROOFS 

This  section  gathers  the  proofs  of  the  main  results  presented  in  the  chapter. 


1.8.1  Proof  of  Theorem  1.18 

Here  we  provide  the  proof  of  the  LaSalle  Invariance  Principle  for  set-valued 
discrete-time  dynamical  systems.  We  remark  that  Theorem  1.16  is  an  im¬ 
mediate  consequence  of  Theorem  1.18  and  that  Theorem  1.17  is  proved  in 
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a  similar  way  (we  refer  to  [Khalil,  2002]  for  details). 


Proof  of  Theorem  1.18.  Let  7  be  any  evolution  of  (X,Xq,T)  starting  from 
W.  Let  £2(7)  C  W  denote  the  w-limit  set5  of  the  sequence  7  =  {7(7  |  i  E 
Z>o}.  First,  let  us  prove  that  12(7)  is  weakly  positively  invariant.  Let 
z  E  f2(y).  Then  there  exists  a  subsequence  {' y(£m )  |  m  E  Z>o}  of  7  such 
that  lim  7 (£m)  =  z.  Consider  the  sequence  {' y(£m  +  1)  |  m  E  Z>o}. 

ra— >+oo 

Since  this  sequence  is  bounded,  it  has  a  convergent  subsequence.  For  ease 
of  notation,  we  use  the  same  notation  to  refer  to  it,  i.e. ,  there  exits  y  such 
that  lim  7 (£m  +  1)  =  y.  By  definition,  y  E  ^(7).  Moreover,  using  the 

m— >+00 

fact  that  T  is  closed,  we  deduce  that  y  E  T(z).  Therefore,  12(7)  is  weakly 
positively  invariant. 

Now,  consider  the  sequence  1/07  =  {1/(7 (£))  \  £  E  Z>o}.  Since  7  is 
bounded  and  V  is  non-increasing  along  T  on  W,  the  sequence  V  o  7  is 
decreasing  and  bounded  from  below,  and  therefore,  convergent.  Let  cER 
satisfy  lim  1/(7 (if)  =  c.  Next,  we  prove  that  the  value  of  V  on  12(7)  is 

£—>+00 

constant  and  equal  to  c.  Take  any  z  E  f2(y).  Accordingly,  there  exists  a 
subsequence  {' y(£m )  |  fn  E  Z>o}  such  that  lim  j(£m)  =  z.  Since  V  is 

m— >+00 

continuous,  lim  V(iy(£m))  =  V{z).  From  lim  V{pf(£))  =  c,  we  conclude 

m— >+OC  £—>-1-00 

that  V  ( z )  =  c. 

Finally,  f2(y)  being  weakly  positively  invariant  and  V  being  constant  on 
0(7)  imply  that 

0(7)  C  {z  E  X  |  By  E  T{z)  such  that  V(y)  =  V{z)}. 

Therefore,  we  conclude  that  lim  dist(7(L),  5  n  K_1(c))  =  0,  where  S  is 

i— >+00 

the  largest  weakly  positively  invariant  set  contained  in  {p  E  W  \  3p'  E 
T(p)  such  that  V(p')  =  V(p)}.  ■ 


1.8.2  Proofs  of  Lemmas  1.23  and  1.24 


Proof  of  Lemma  1.23.  The  first  statement  is  obvious.  Regarding  the  second 
statement,  we  prove  that  a  topologically  balanced  digraph  with  a  globally 
reachable  node  is  strongly  connected,  and  leave  the  proof  of  the  other  case 
to  the  reader.  We  reason  by  contradiction.  Assume  G  is  not  strongly  con¬ 
nected.  Let  Sc  h  be  the  set  of  all  nodes  of  G  that  are  globally  reachable. 


5The  aj-limit  set  of  a  sequence  7  =  {7 (£)  \  l  €  Z>q}  is  the  set  of  points  y  for  which  there  exists 
a  subsequence  {7 (^m)  |  tti  G  Z> q }  of  7  such  that  lim  7 (£m)  =  y- 

—  m — >+cxd 
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By  hypothesis,  5/0.  Since  G  is  not  strongly  connected,  we  have  S  C.  V. 
Note  that  any  outgoing  edge  with  origin  in  a  globally  reachable  node  auto¬ 
matically  makes  the  destination  a  globally  reachable  node  too.  This  implies 
that  there  cannot  be  any  outgoing  edges  from  a  node  in  5  to  a  node  in 
V\S.  Let  v  £  V\S  such  that  v  has  an  out-neighbor  in  5  (such  a  node  must 
exist,  since  otherwise  the  nodes  in  5  cannot  be  globally  reachable).  Since  by 
hypothesis  G  is  balanced,  there  must  exist  an  edge  of  the  form  (w,  v )  £  E. 
Clearly,  w  /  5,  since  otherwise  v  would  be  globally  reachable  too,  which 
is  a  contradiction.  Therefore,  w  £  V  \  5.  Again,  using  the  fact  that  G  is 
topologically  balanced,  there  must  exist  an  edge  of  the  form  (z,  w)  £  E.  As 
before,  z  £  V\S  (note  that  z  =  v  is  a  possibility).  Since  V\S  is  finite  and  so 
is  the  number  of  possible  edges  between  its  nodes,  applying  this  argument 
repeatedly,  we  find  that  there  exists  a  vertex  whose  out-degree  is  strictly 
larger  than  its  in-degree,  which  is  a  contradiction  with  the  fact  that  G  is 
topologically  balanced.  We  refer  to  [Cortes,  2008b]  for  the  proof  that  G  is 
Eulerian.  ■ 


Proof  of  Lemma  1.24 ■  (i)  =>•  (ii)  Assume  that  i  £  V  is  the  root  of 

the  spanning  tree  and  take  an  arbitrary  pair  of  nonempty,  disjoint  sub¬ 
sets  U\,  U2  C  V.  If  i  £  U\,  then  there  must  exist  a  path  from  i  £  U\  to  a 
node  in  Ui-  Therefore,  U2  must  have  an  in- neighbor.  Analogously,  if  i  £  C/2, 
then  U\  must  have  an  in-neighbor.  Finally,  it  is  possible  that  i  (f  C/1UC/2. 
In  this  case,  there  exists  paths  from  i  to  both  U±  and  1/2,  that  is,  both  sets 
have  in-neighbors. 

(ii)  ==r  (i)  This  is  proved  by  finding  a  node  from  which  there  exists  a 
path  to  all  others.  We  do  this  in  an  algorithmic  manner  using  induction.  At 
each  induction  step  k,  except  the  last  one,  four  sets  of  nodes  are  considered 
U\{k)  C  W\{k)  C  V,  L2 (fc)  C  W2(k)  C  V  with  the  following  properties: 

(a)  The  sets  W\(k)  and  W2(k)  are  disjoint;  and 

(b)  from  each  node  of  Us(k)  there  exists  a  path  to  each  other  node  in 
Ws(k)\Ua(k),  a  £  {1,2}. 

Induction  Step  k=l:  Set  U\  =  W\  =  {/}  and  U2  =  W2  =  {*2})  where  i\,  i2 
are  two  arbitrary  different  nodes  of  the  graph  that  satisfy  the  requirements 
(a)  and  (b). 

Induction  Step  k  >  1:  Suppose  that  for  k  —  1  we  found  sets  U\(k  —  1)  C 
W\{k  —  1)  and  U2 (k  —  1)  C  W(k  —  1)  as  in  (a)  and  (b).  Since  U\  (k  —  1) 
and  C2 (^  —  1)  are  disjoint,  then  there  exists  either  an  edge  (ik,j  1)  with 
ji  G  U\(k  —  1),  ik  £  V  \  U\{k  -  1),  or  an  edge  (4,  j2)  with  j2  £  U2(k  -  1) 
and  ik  £  V  \  U2(k  —  1).  Suppose  that  an  edge  (4,72)  exists  (the  case  of  a 
edge  (4,4)  can  be  treated  in  a  similar  way).  Only  four  cases  are  possible. 
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(A)  If  ik  E  W\(k  —  1)  and  W\(k  —  1)  U  W2(k  —  1)  =  V,  then  we  can  ter¬ 
minate  the  algorithm  and  conclude  that  from  any  node  i  E  U\(k  —  1) 
there  exists  a  path  to  all  other  nodes  in  the  graph  and  thus  there  is  a 
spanning  tree. 

(B)  If  ik  E  W\(k  —  1)  and  Wx(k  -  1)  U  W2(k  -l)^V,  then  set: 

U1(k)  =  U1(k-l), 

W\  (k)  =  Wi(k-l)UW2(k-2), 

U2(k)  =  W2{k)  =  {ik}, 

where  ik  is  an  arbitrary  node  which  does  not  belong  to  W\(k—  1)  U  W2(k— 

!)• 

(C)  If  ik  i  Wx{k  -  1)  UW2{k  -  1),  then  set 

Ui{k)  =  U\(k  —  1), 

W1(k)  =  W1(k-l), 

U2(k )  =  {ik}, 

W2(k)  =  W2(k-l)U{ik}. 

(D)  If  ik  E  W2(k  -  1)  \  U2{k  -  1)  then 

U\(k)  =  U\(k  —  1), 

W1(k)  =  W1(k-l), 

U2(k)  =  U2(k  —  1)  U{ifc}, 

W2(k)  =  W2{k-  1). 

The  algorithm  terminates  in  a  finite  number  of  induction  steps  because  at 
each  step,  except  when  finally  case  (A)  holds  true,  either  the  number  of 
nodes  in  W\  U  W2  increases,  or  the  number  of  nodes  in  W\  U  W2  remains 
constant  and  the  number  of  nodes  in  U\\J  U2  increases.  ■ 


1.8.3  Proofs  of  Propositions  1.30  and  1.32 

Proof  of  Proposition  1.30.  (ii)  =>  (i)  We  aim  to  show  that  there  exist 

directed  paths  from  any  node  to  any  other  node.  Fix  i  E  {1, . . . ,  n}  and  let 
Ri  C  {1, . . . ,  n}  be  the  set  of  nodes  that  belong  to  directed  paths  originating 
from  node  i.  Denote  the  unreachable  nodes  by  Ui  =  {1, . . .  ,n}  \  Ri.  We 
argue  that  Ui  cannot  contain  any  element,  because  if  it  does,  then  Ri  U  Ui 
is  a  nontrivial  partition  of  the  index  set  {1, . . . ,  n}  and  irreducibility  implies 
the  existence  of  a  non-zero  entry  ajk  with  j  E  Ri  and  k  E  Ui.  Therefore, 
Ut  =  0,  and  all  nodes  are  reachable  from  i.  The  converse  statement  (i)  =>- 
(ii)  is  proved  similarly. 
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(i)  =>  (iii)  If  G  is  strongly  connected,  then  there  exists  a  directed  path 
of  length  k  <  n  —  1  connecting  any  node  i  to  any  other  node  j.  Hence, 
by  Lemma  1.29(h),  the  entry  ( Ak)ij  is  strictly  positive.  This  immediately 
implies  the  statement  (iii).  The  converse  statement  (iii)  ==>  (i)  is  proved 
similarly.  ■ 


Next  we  present  a  useful  number  theory  result.  This  states  that  relatively 
co-prime  numbers  generate  all  sufficiently  large  natural  numbers. 

Lemma  1.80  (Natural  number  combination).  Let  a\, . . . ,  av  £  N  have 
greatest  common  divisor  1.  There  exists  k  £  N  such  that  every  number 
m  >  k  can  be  written  as 


771  —  OL\CL\  -J-  •  •  •  -|- 

for  appropriate  numbers  a\ , ,ax  £  N. 


Proof.  Assume  that  a\  <  ■  ■  ■  <  ajv  without  loss  of  generality.  From  the 
generalized  Bezout  identity  we  know  that,  for  any  numbers  a\, ... ,  a]y  with 
greatest  common  divisor  1 ,  there  exist  integers  71 , . . . ,  7 n  £  Z  such  that 

1  =  7i“iH - l~7ivffiv ■  (1.8.1) 

Pick  k  =  I71  |af  +  •  •  •  +  |7iv|a^r  £  N.  Every  number  m  >  k  can  be  written  as 

m  =  k  +  TUqtntOl  T  ^rmndr : 

for  appropriate  numbers  mqtnt  >  0  and  1  <  mrmn dr  <  a\ ■  Using  the  defini¬ 
tion  of  k  and  equation  (1.8.1),  we  write 

m  =  (|7i|°i  H - 1-  \lN\a2N)  +  mqtntai  +  mrmndr(7iai  -| - b  7Aradv) 

=  J^lqtntOl  (|7l|fll  +  W-rmndr7l)®l  T  '  '  '  T  ( 1 7JV  |  ®  JV  T  W-rmndr7Af )®  N  ■ 

The  proof  is  now  complete  by  noting  that  each  integer  number  (I71  |o-i  + 
mrmndr7i),  ■  •  ■ ,  (|7iv|ajv  +  "irmndr7Ar)  is  strictly  positive  because  mrmndr  < 
ai  <  ■  ■  •  <  aj\[.  ■ 


Proof  of  Proposition  1.32.  (i)  =b  (ii)  Pick  any  i.  We  claim  there  exists  a 
number  k[i)  with  the  property  that  for  all  m  >  k(i)  we  have  that  (Am) a  is 
positive,  that  is,  there  exists  a  directed  path  from  i  to  i  of  length  m  for  all  m 
larger  than  a  number  k[i).  To  show  this  claim,  let  {ci, . . . ,  cjv}  be  the  set  of 
the  cycles  of  G  and  let  {^i, . . . ,  be  their  lengths.  Because  G  is  aperiodic, 
Lemma  1.80  implies  the  existence  of  a  number  h(£\, . . .  ,£n)  such  that  any 
number  larger  than  h(£  1, . . .  ,£n)  is  a  linear  combination  of  £1, . . . , tjv  with 
natural  numbers  as  coefficients.  Because  G  is  strongly  connected,  there 
exists  a  path  7  of  arbitrary  length  T(z)  that  starts  at  i,  contains  a  vertex 
of  each  of  the  cycles  c±, . . . ,  cjy,  and  terminates  at  i.  Now,  we  claim  that 
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k(i)  =  r(i)+/i(^i, . . . ,  In)  has  the  desired  property.  Indeed,  pick  any  number 
rri  >  k(i )  and  write  it  as  k  =  T{i)+j3\l\  +  -  ■  ■ +Pn@n  for  appropriate  numbers 
Pi, . . . ,  Pn  £  N.  A  directed  path  from  i  to  i  of  length  m  is  constructed  by 
attaching  to  the  path  7  the  following  cycles:  Pi  times  the  cycle  ci,  P2  times 
the  cycle  C2,  . . . ,  Pn  times  the  cycle  cn ■  Finally,  having  proved  the  existence 
of  k(i)  with  the  desired  property,  let  K  be  the  maximum  k(i)  over  all  nodes 
i,  and  recall  that  diam(G')  is  the  maximum  pairwise  distance  between  nodes. 
Clearly,  Aai  is  positive  for  all  M  >  K  +  diam(G'). 

(ii)  =>  (i)  From  Lemma  1.29  we  know  that  Ak  >  0  means  that  there 
are  paths  from  every  node  to  every  other  node  of  length  k.  Hence,  the 
digraph  G  is  strongly  connected.  Next,  we  prove  aperiodicity.  Because  G  is 
strongly  connected,  each  node  of  G  has  at  least  one  outgoing  edge,  i.e. ,  for 
all  i,  there  exists  at  least  one  index  j  such  that  a^-  >  0.  This  fact  implies 
that  the  matrix  Ak+1  =  AAk  is  positive  via  the  following  simple  calculation: 
(■ Ak+1)u  =  YJh=i  aih(Ak)hl  >  aij(Ak)ji  >  0.  In  summary,  we  have  shown 
that,  if  Ak  is  positive  for  some  k,  then  Am  is  positive  for  all  subsequent 
m  >  k.  Therefore,  there  are  cycles  in  G  of  any  length  greater  than  or  equal 
to  k,  which  means  that  G  is  aperiodic.  ■ 


1.8.4  Proof  of  Theorem  1.34 

Proof.  We  begin  with  statement  (i).  Let  Uj,  for  i. j  E  {l,...,n},  be  the 
entries  of  L(G).  Note  that  la  =  Y^=i  —  0  and  hj  =  ~aij  <  0  for 

i  ^  j.  By  the  Gersgorin  Disks  Theorem  1.3,  we  know  that  each  eigenvalue 
of  L{G )  belongs  to  at  least  one  of  the  disks 

n 

^zEC|  \\z  ^zi||c—  ^  "  |  k,j  |}  —  ^  z  E  C  |  ||  z  la  1 1  c  ^  la  ^ . 

j=l,j^=i 

These  disks  contain  the  origin  0n  and  complex  numbers  with  positive  real 
part.  This  concludes  the  proof  of  statement  (i). 

Regarding  statement  (ii),  note  that  DOVLt(G)  is  invertible  because  G  is 
strongly  connected.  Define  the  two  matrices  A  =  Dout(G)_1H(G)  and  L  = 
Dout(G)~1L(G)  and  note  that  they  satisfy  L  =  In  —  A.  Since  DOVLt(G) 
is  diagonal,  the  matrices  A(G)  and  A  have  the  same  pattern  of  zeros  and 
positive  entries.  This  observation  and  the  assumption  that  G  is  strongly 
connected  imply  that  A  is  nonnegative  and  irreducible.  By  the  Perron- 
Frobenius  Theorem  1.9,  the  spectral  radius  p(A)  is  a  simple  eigenvalue. 
Furthermore,  one  can  verify  that  A  is  row-stochastic  (see  Lemma  1.28), 
and  therefore,  its  spectral  radius  is  1  (see  Exercise  El. 4).  In  summary,  we 
conclude  that  1  is  a  simple  eigenvalue  of  A,  that  0  is  a  simple  eigenvalue  of 
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L,  that  L  has  rank  n  —  1,  and  that  L(G )  has  rank  n  —  1. 

Regarding  statement  (iii),  we  first  prove  that  rank(L(G))  =  n  —  1  implies 
the  existence  of  a  globally  reachable  vertex.  By  contradiction,  let  G  con¬ 
tain  no  globally  reachable  vertex.  Then,  by  Lemma  1.24,  there  exist  two 
nonempty  disjoint  subsets  Ui,  U2  C  V(G)  without  any  out-neighbor.  After 
a  permutation  of  the  vertices,  the  adjacency  matrix  can  be  partitioned  into 
the  blocks 

An  0  0 

A(G )  =  0  A22  0 

_A;i  1  A32  A33_ 

Here,  A\2  and  Ai 3  vanish  because  U\  does  not  have  any  out-neighbor,  and 
A21  and  A23  vanish  because  U2  does  not  have  any  out-neighbor.  Note  that 
Du  ~  An  and  D22  —  A22  are  the  Laplacian  matrices  of  the  graphs  defined 
by  restricting  G  to  the  vertices  in  Ui  and  in  U2,  respectively.  Therefore,  the 
eigenvalue  0  has  geometric  multiplicity  at  least  2  for  the  matrix  Dout(G )  — 
A(G).  This  contradicts  the  assumption  that  rank(L(G))  =  n  —  1. 


Next,  still  regarding  statement  (iii),  we  prove  that  the  existence  of  a  glob¬ 
ally  reachable  vertex  implies  rank(L(G))  =  n  —  1.  Without  loss  of  generality, 
we  assume  that  G  contains  self-loops  at  each  node  (so  that  D0 ut  is  invert¬ 
ible).  Let  R  be  the  set  of  globally  reachable  vertices;  let  r  £  {1, . . . ,  n}  be  its 
cardinality.  If  r  =  n,  then  the  graph  is  strongly  connected  and  statement  (ii) 
implies  rank(L(G))  =  n  —  1.  Assume  therefore  r  <  n.  Renumber  the  ver¬ 
tices  so  that  R  is  the  set  of  the  first  r  vertices.  After  this  permutation,  the 
adjacency  matrix  and  Laplacian  matrix  can  be  partitioned  into  the  blocks 


MG) 


An  0 

A21  A22 


and  L(G ) 


Ln  0 
L21  L22 


Here,  A12  £  Rrx(n-r)  vanishes  because  there  can  be  no  out-neighbor  of  R 
otherwise  that  out-neighbor  would  be  a  globally  reachable  vertex  in  V  \  R. 
Note  that  the  rank  of  Ln  £  Wxr  is  exactly  r  —  1  since  the  digraph  associated 
to  An  is  strongly  connected.  To  complete  the  proof  it  suffices  to  show  that 
the  rank  of  L22  £  R(n_r)x  is  full.  Note  that  the  same  block  partition 
applies  to  the  matrices  A  =  D~^tA  and  L  =  D~^tL  considered  in  the  proof 
of  statement  (ii)  above.  With  this  block  decomposition,  we  compute 

in-l  =  f  A^1  0  1 

|_A2i(ra  —  1)  M21]  ’ 

for  some  matrix  A2i(n  —  1)  that  depends  upon  An,  A21  and  A22-  Because 
a  globally  reachable  node  in  G  is  globally  reachable  also  in  the  digraph 
associated  to  A,  Proposition  1.30(v)  implies  that  A2i(n  —  1)  is  positive. 
This  fact,  combined  with  the  fact  that  A  and  hence  An_1  are  row-stochastic, 
implies  that  A^1  has  maximal  row  sum  (that  is,  oo-induced  norm)  strictly 
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less  than  1.  Hence,  the  spectral  radius  of  A and  of  A 22  are  strictly 
less  than  1.  Since  A 22  has  spectral  radius  strictly  less  than  1,  the  matrix 
L22  =  In-r  ~  A22,  and  in  turn  the  matrix  L 22,  have  full  rank. 

Regarding  statement  (iv),  the  equivalence  between  (iv)a  and  (iv)b  is 
proved  as  follows.  Because  YHj= 1  hj  =  d0ut(vi)  —  d\a{vi)  for  all  i  E  {1, . . . ,  n}, 
it  follows  that  1  JlL(G)  =  0^  if  and  only  if  Dout(G)  =  Dm(G).  Next,  we 
prove  that  (iv)b  implies  (iv)c.  Suppose  that  L{G)1  ln  =  0^  and  consider 
the  system  7 (t)  =  —L(G)j(t),  7(0)  =  xo,  together  with  the  positive  definite 
function  V  :  Mn  — >  R  defined  by  V (x)  =  xTx.  We  compute  the  Lie  derivative 
of  the  function  V  along  the  vector  field  x  1— »  —L(G)x  as  V(x)  =  —2 xT  L(G)x. 
Note  that  V(x)  <  0,  for  all  x  E  Mn,  is  equivalent  to  L(G)  +  L(G)T  >  0. 
Because  l^L(G)  =  0^  and  L(G)ln  =  0n,  it  is  immediate  to  establish  that 
exp (-L(G)t),  f  6  R,  is  a  doubly  stochastic  matrix.  From  Theorem  1.2  we 
know  that,  if  we  let  {Pa}  be  the  set  of  n  x  n  permutation  matrices,  then 
there  exist  time-dependent  convex  combination  coefficients  ^aAa(t)  =  1, 
Xa(t)  >  0,  such  that  exp(— L(G)t)  =  XQ(t)Pa.  By  the  convexity  of  V 
and  its  invariance  under  coordinate  permutations,  for  any  x  £  Mn,  we  have 

V(exp(-L(G)t)x)  =  V{^2  A a{t)Pax) 

a 

<  ^2  A a(t)V (. PaX )  =  Y2  a a(t)V(x)  =  V(x). 

a  a 

In  other  words,  R(exp(— L(G)t)x)  <  V[x)  for  all  x  £  Rn,  which  implies 
V(x)  <  0,  for  all  x  £  Mn.  Finally,  we  prove  that  (iv)c  implies  (iv)b.  By 
assumption,  — xT(L(G )  +  L(G)T)x  =  —2 xT L(G)x  <  0  for  all  x  £  Mn.  In 
particular,  for  any  small  e  >  0  and  x  =  ln  —  eL(G)Tln, 

-(In  -  ellL(G))L(G)(ln  -  eL(G)T ln)  =  e\\L(G)Tln\\22  +  0{e2)  <  0, 
which  is  possible  only  if  L(G)'1  ln  =  0^.  ■ 


1.8.5  Proofs  of  Theorem  1.60  and  Proposition  1.64 

In  this  section  we  prove  Theorem  1.60.  The  exposition  follows  along  the  main 
lines  of  the  original  proof  by  Moreau  [2005] ,  with  the  variation  of  using  the 
LaSalle  Invariance  Principle  for  set-valued  dynamical  systems,  presented  as 
Theorem  1.18.  We  begin  with  some  preliminary  results. 

Lemma  1.81  (Union  of  graphs  and  sums  of  adjacency  matrices). 

Let  G\, ...  ,Gs  be  unweighted  digraphs  with  common  node  set  {1, . . .  ,n}  and 
adjacency  matrices  A\, . . . ,  A&.  The  unweighted  digraph 

G\\J-  ■  -\JG$  =  ({l,...,n},E{Al)U---UE(As)) 


75 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


is  equal  to  the  unweighted  digraph  associated  to  the  nonnegative  matrix 
Sfceji  5}Ak,  that  is,  the  unweighted  digraph  ({1, ...  ,n},  E(A\  +  - •■  + As)). 


Proof.  If  (i,  j)  E  U ke{i,...,S}  E(Gk),  then  there  exists  ko  E  <5}  such 

that  (i,j)  E  E(Gk0).  Denoting  the  entries  of  the  matrix  A &  by  aij(k),  this 
implies  that  a^(fco)  >  0,  that  a^(  1)  +  •  •  •  +  aij{S)  >  0,  and  that  (i,j)  is  an 
edge  in  E(A\  +  •  •  ■  +  A$).  The  converse  statement  is  easily  proved  with  an 
analogous  reasoning.  ■ 


In  what  follows,  for  a  E  ]0, 1],  let  F(a)  denote  the  set  of  n  x  n  stochastic 
matrices  that  are  non-degenerate  with  respect  to  a.  Given  a  E  ]0, 1]  and 
5  E  N,  define  the  sets  Fg(a)  C  WlXn  by 

F$(a)  =  {.F  E  F(a5)  \  3  F\, . . . ,  Fg  E  F(a)  such  that  F  =  Fg  ■  ■  ■  F\ 

and  G(Fi)  U  •  •  •  U  G(Fg)  contains  a  globally  reachable  node}, 

or,  equivalently  by  Proposition  1.30, 

Fg(a)  =  {.F  E  F(ofi)  \  3Fi,...,Fg  E  F(a)  such  that  F  =  F$  ...  T\ 

and  a  column  of  (Ti  +  •  •  •  +  Fs)n  has  positive  entries}. 

Lemma  1.82  (Compact  sets  of  stochastic  matrices).  Fora  E  ]0,1], 
the  sets  F(a)  and  Fg(a),  5  E  N,  are  compact. 


Proof.  All  sets  are  clearly  bounded.  We  invite  the  reader  in  Exercise  El. 23 
to  prove  that  F{a)  is  closed.  Let  us  prove  now  that  Fg(a)  is  closed.  Consider 
a  matrix  sequence  {F(k)  \  k  E  N}  C  Fs(a)  convergent  to  some  matrix  F. 
Because  F(as)  is  closed,  we  establish  that  F  E  F(as).  Because  each  matrix 
F(k)  belongs  to  Fg(a),  there  exist  matrices  F\(k), . . .  ,F$(k )  E  F(a)  such 
that  F(k)  =  Fs(k)  ■  ■  •  Fi(k).  We  claim  that  there  exists  a  sequence  ki  E  N, 
for  l  E  N,  such  that,  for  all  s  E  (1, . . . ,  <5},  the  matrix  sequences  Fs(ki),  l  E  N, 
are  convergent.  (To  see  this,  note  that  F\  (k)  takes  value  in  a  compact  set, 
hence  it  must  have  a  convergent  subsequence.  Restrict  F'2(k)  to  the  instants 
of  time  in  the  convergent  subsequence  for  F\(k)  and  observe  that  it  takes 
value  in  a  compact  set,  etc.)  Therefore,  there  exist  matrices  Fs,  to  which  the 
matrix  sequences  Fs(ki),  l  E  N,  converge.  Taking  the  limit  as  l  — >  +oo  in  the 
equality  F(ki)  =  Fg(ki)  ■  ■  ■  F\  (k/),  we  establish  that  F  =  F$  ■  ■  ■  F\.  Finally, 
it  remains  to  show  that  a  column  of  B  :=  (F±  +  •  ■  ■  +  Fg)n  has  positive 

entries.  For  k  E  N,  define  B{k)  =  (Fi(k)-\ - \-Fs(k))n.  Clearly,  B{k)  — >  B 

as  k  — >■  Too.  By  the  definition  of  the  sequence  F(k),  each  B(k)  =  ( bij(k )) 
has  the  property  that  there  exists  jk  E  (1, . . .  ,n}  such  that  bijk(k)  >  0  for 
all  i  E  (1, . . . ,  n}.  Since  {1, . . . ,  n}  is  a  finite  set,  there  exists  jo  E  {1, . . . ,  n} 
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that  satisfies  this  property  for  an  infinite  subsequence  of  matrices  B(ki), 
l  £  N.  With  some  straightforward  bookkeeping,  we  write: 

S  n  n 

E  ■  E  (Fa,(kl))ihl  ■  ■  •  {Fan{h))hn_do. 

ai,...,an=l  h1=l  hn_i=l 

Note  that,  because  Fs(k)  £  F(a),  for  s  £  {1,...,5},  each  nonzero  entry 
Fs(k)  is  lower  bounded  by  a  >  0.  Furthermore,  each  entry  ( B(ki))ij0  is  the 
sum  of  nonnegative  terms,  each  of  which  is  the  product  of  n  factors,  each 
of  which  is  lower  bounded  by  a.  Hence,  because  ( B(ki))ij0  is  positive,  it 
is  also  lower  bounded  by  an.  Since  lim  B(k{)  =  B,  by  the  compactness 

Z— H |-oo 

of  [an,  1]  U{0},  it  must  be  that  B  =  ( bij )  satisfies  bji0  >  an  >  0  for  all 
j£{l,...,n}.  In  particular  this  implies  that  F  £  Fs(a)  and  then  F/j(a)  is 
closed.  ■ 


Finally,  we  are  able  to  prove  the  equivalences  in  Theorem  1.60. 


Proof  of  Theorem  1.60.  First,  we  prove  that  (i)  implies  (ii).  Suppose  that 
for  all  5  £  N,  there  exists  some  £q  £  N  such  that  the  digraph  with  edges 
Use[4A+<51  F(F(s))  does  not  contain  a  globally  reachable  node.  By  Lemma  1.24 
there  must  exist  a  set  of  nodes  U\,U2  C  { 1 , . . . ,  n}  such  that  there  are  no 
out-going  edges  with  i\  £  Ui,  i\  £  {l,...,n}  \  U\  or  (*2,^2),  with 

j2  £  U2,  12  £  {1,  ■  •  • ,  n}  \  U2-  Take  any  values  a,  b  £  R,  a  /  b,  and  consider 
the  initial  condition: 


Wi(£o) 


o,  i  £  U±, 

<  b,  i  £  U2, 

c  £  co (a,  b),  i  £  {1, . . . ,  n}  \  (XJ\  U  L^)- 


Because  of  the  properties  of  U\  and  U2,  for  all  5  £  N,  we  must  have 


Wj  (£q  +5+1) 


a,  j  £  Ui, 

<6,  j  £  U2, 

c  £  co  (a,  6),  j  £  {1, . .  -  ,n}\  {Ux  U  U2). 


Because  5  can  be  chose  arbitrarily  large,  one  can  easily  construct  a  con¬ 
tradiction  with  the  fact  that  diag(Mn)  is  supposed  to  be  uniformly  globally 
attractive. 


Next,  we  show  that  (ii)  implies  (i).  Let  a  £  ]0, 1]  to  be  the  scalar  with 
respect  to  which  the  sequence  is  non-degenerate.  Consider  the  set-valued 
discrete-time  dynamical  system  (Mn,  Ta<s),  with  evolution  map  Ta : 
Mn  Mn  defined  by 

Fa,s(w)  =  {Fw  |  F  £  Fs(a)}. 
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Because  of  this  definition,  any  trajectory  w  :  Z>o  — ►  Rn  of  the  averaging 
algorithm  (1.6.2)  satisfies 

w((k  +  1)<5)  G  Tats(w(k5)),  k  G  Z>0. 

Next,  we  intend  to  use  the  LaSalle  Invariance  Principle  for  set-valued  dis¬ 
crete  systems,  presented  as  Theorem  1.18,  to  prove  that  lim  dist(u;(M),  diag(Rn)) 

£— >+oo 

0.  This  will  then  imply,  by  Lemma  1.21,  the  uniform  attractivity  statement 
in  theorem.  In  the  following  we  check  the  conditions  of  the  theorem. 

Closedness  of  the  set-valued  dynamical  system.  Consider  a  pair  of  vector 
sequences  {xk  \  k  G  N}  and  {yk  \  k  G  N}  in  Rn  such  that  linifc^+ooX/c  = 
x,  limk^+ooVk  =  V  and  yk  G  Tat$(xk),  for  all  k  G  N.  We  need  to  show 
that  y  G  Ta^(x).  By  definition  of  Ta and  because  yk  G  Ta^ s(xk),  there 
exists  a  sequence  {F(k)  \  k  G  N}  C  F$(a)  such  that  F(k)xk  =  yk,  for  all 
k  G  N.  Furthermore,  since  F$(a)  is  compact  by  Lemma  1.82,  there  exists 
a  subsequence  {F(ki)  \  l  G  N}  that  is  convergent  to  some  F  G  F$(a).  The 
desired  conclusion  follows  from 

y  =  lim  yke  =  lim  F{kt)xkl  =  Fx. 

Z— >+oo  l — >-\-oo 

Non-increasing  Lyapunov  function.  Define  the  function  V  :  Mn  — >  R>o  by 
V  (x)  =  max  Xi  —  min  Xi . 

ie{l,...,re}  ie{l,...,n} 

Note  that  V  is  continuous.  Pick  any  x  G  Mn  and  any  stochastic  matrix 
F  G  Fs{a).  Recall  that  Halloo  =  maxjg^  |xj|,  and  that  HFHoo  =  1. 
Therefore,  by  definition  of  induced  norm,  IjFxHoo  <  H^Hoo-  Similarly,  in 
components, 

(Fx)i=  fijxj  —  (  fa)  n\Xk' 

j£{l....,n}  je{l,...,n} 

that  implies  minig{li  n}(Fx)j  >  minfegr1)  n\  Xk-  Therefore,  we  have  that 
V(Fx)  <  V(x)  for  all  x  G  Mn  and  F  G  Fg(a).  In  other  words,  the  function 
V  is  non-increasing  along  Ta  $  in  Mn. 

Boundedness.  It  is  immediate  to  see  that,  since  ||Fa:||oo  £  IWloo  for  all 
stochastic  matrices  F  and  vectors  x,  the  trajectory  k  >— *•  w(k8 )  is  bounded. 

Invariant  set.  By  Theorem  1.18,  any  trajectory  of  Ta.s,  and  hence  also  the 
trajectory  w  :  Z>o  — >  Mn  of  the  averaging  algorithm  (1.6.2),  will  converge 
to  the  largest  weakly  positively  invariant  set  contained  in  a  level  set  of  the 
Lyapunov  function  V  and  in  a  set  where  the  Lyapunov  function  does  not 
decrease  along  T.  In  the  following  we  determine  that  this  set  must  be 
contained  in  diag(Rn). 
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For  k  £  N  fixed,  assume  w(k5)  satisfies  V(w(k5 ))  >  0.  Given  the  averag¬ 
ing  algorithm  (1.6.2)  defined  by  the  sequence  {F(l)  \  l  £  Z>o}  C  F(a), 
define  F\(k)  =  F{k  +  1),  ...,  Fg(k)  =  F(&;  +  <5).  Additionally,  define 
F(k)  =  •  •  ■  F\(k)  and  note  that  F(k)  £  Fs{a),  by  construction.  With 

this  notation,  note  that  w(k5  +  s)  =  Fs{k )  •  •  •  Fi(k)w(k8)  for  s  £  {1, . . . ,  <5}. 
Define  wm  =  maxjgjq ^n\  Wi(k8)  and  wm  =  rriin^n  ^j  Wi{k5)\  by  hypoth¬ 
esis  we  know  wm  >  Define  Um  =  {i  £  {1,  •  ■  • ,  n}  \  w{k5 )  =  wm}  and 
Um  =  {i  £  {1,  ■  •  • , n}  |  Wj(k5 )  =  wm };  by  hypothesis  we  know  UM^Um  =  0. 
Now,  we  are  ready  to  use  the  property  (ii)  in  the  theorem  statement.  Since 
({1, . . . ,  n},  Use{i  .. E(Fs(k ))  contains  a  globally  reachable  node  and  since 
Um  and  Um  are  nonempty  and  disjoint,  then  Lemma  1.24  implies  that  there 
exists  either 


•  (an  out-neighbor  of  Um)  an  edge  (zm,  Jm)  £  E(Fs(kS ))  with  %m  £  Um , 
3m  £  {l,...,n}\  UM,  and  s  £  {1,...,<5};  or 


•  (an  out-neighbor  of  Um )  an  edge  (im,jm)  £  E(Fs(k5 ))  with  zm  £  [7m, 
jm  £  {1, . . . , n}  \  t/m,  and  s  £  {1, ...  ,5}. 


Without  loss  of  generality,  suppose  that  an  edge  (im,3m)  exists  and  let 
so  £  {1,  •  •  • ,  5}  be  the  first  time  index  for  which  this  happens.  We  have  that 


•  for  every  s  £  {1, . . . ,  so  —  1},  there  does  not  exist  any  edge  (i,  h)  with 
i  £  Um  and  h  ^  Um,  and,  thus,  for  all  i  £  Um', 


Wi{k5  +  1)  =  y^XFi{k))jjWj(k8)  =  ^  (Fi(k))ihwh(k5) 
j= i  h&UM 

=  (  Y]  (Fi(k))ih(k)^wM  =  wM- 

h(zU M 

The  same  argument  can  be  repeated  for  F^ik), . . . ,  Fs(k),  so  that 
Wi(k5  +  s)  =  wm  for  all  i  £  Um', 


•  if  i  £  Um  at  time  kS,  then  Wi(k5  +  s)  <  wm  for  all  s  £  {1, . . . ,  so  —  1}. 
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To  see  this,  we  compute 

n 

Wi(k5  +  1)  =  ^2(Fi(k))ijWj(k8) 

3= 1 

n 

=  (Fi(k))uWi(k8)  +  ^  ( Fi(k))ijWj(k5 ) 

n 

<  (Fi(k))iiWi(kS)  +  (^  ^2  (Ti (k))ij^wM 

j=l,j=£i 

<  awi(kd )  +  (1  —  a)wM  <  wm, 

where  we  used  the  assumption  of  non-degeneracy  with  parameter  a  6 
]a,  1].  The  same  argument  can  be  repeated  for  the  subsequent  multi¬ 
plications  by  the  matrices  F2(k), ... ,  Fs(k). 


We  finally  reach  time  so  and  compute 

n 

wiM{k5  +  s0)  =  '22(FSo {k))iMjwj (k5  +  s0  -  1) 

3= 1 

n 

=  (■ Fs0{k))iMjMwjM{k8  +  s0  -  1)  +  {Fs0{k))iMjWj{k5  +  s0  -  1) 

3=1,3¥=3m 

n 

<  (FSo(k))iMjMwM  +  (Fso(k))iMjWj(kd  +  so-l)<wM- 

J=b  j/jM 

This  implies  that  WiM((k  +  1)5)  <  wm  so  that  does  not  belong  to  Um  at 
time  ( k  +  1)5.  That  is,  the  cardinality  of  Um  decreases  at  least  by  one  after 
(l’+l)(5.  Since  {1, . . . ,  n}  is  finite,  after  repeating  this  argument  at  most  n— 1 
times,  we  have  that  either  Um  becomes  empty  at  time  (k  +  n  —  1)5.  (Here 
we  are  assuming  that  the  out-neighbor  always  exists  for  Um]  an  analogous 
argument  can  be  made  for  the  general  case.)  This  is  enough  to  guarantee 
that  V(w((k  +  n)5 ))  <  wm  ~  =  V(w(k6)).  This  is  what  we  need  to 

conclude  that  lim  dist(?u(M), diag(M))  =  0.  In  summary,  this  concludes 

/c— >+oo 

the  proof  of  Theorem  1.60.  ■ 


We  conclude  this  section  by  establishing  convergence  to  an  individual 
point. 


Proof  of  Proposition  1.64 ■  We  adopt  the  same  notation  as  above,  i.e.,  as  in 
the  proof  of  Theorem  1.60.  Since  F(k )  E  Fg((y.'),  the  set  of  sequence  points 
{w(kS)  |  k  E  N}  belong  to  the  convex  hull  of  all  the  components  of  the  initial 
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condition,  that  is,  [min,  Wi(0),  maxj  tCj(0)]n.  Since  [mini  t+(0),  max,  Wi(0)]n 

is  compact,  there  exists  a  convergent  subsequence  {w(ki5)  \  l  E  N}  to  a  point 

cl„.  We  also  notice  that  for  any  ki  E  N,  Wi((ki+k)5)  E  [min*  Wi(k[S),  max,;  Wi(ki5)]n, 

i  G  {1, . . . ,  n}  and  k  G  N.  Since  lim  w(kiS)  =  cln,  then  lim  [min  Wi(ki5),  max  Wi(ki5)]n 

l->-\-oc  l — >Too  i  i 

cln.  Therefore,  any  sequence  {w((ki  +  k)5)  \  k  G  N},  for  l  G  N,  must  con¬ 
verge  to  cl„.  This  implies  that  lim  w(k5)  =  cln.  ■ 

/c— >+oo 


1.8.6  Proofs  of  Theorems  1.76  and  1.77 

Proof  of  Theorem  1.76.  Let  us  prove  fact  (i).  Because  Tridn(a,  b,  a)  is  a  real 
symmetric  matrix,  Tridn(a,  b,  a)  is  normal  and  its  2-induced  norm,  i.e. ,  its 
largest  singular  value,  is  equal  to  the  magnitude  of  its  eigenvalue  with  largest 
magnitude.  Based  on  this  information  and  on  the  eigenvalue  computation 
in  Lemma  1.74,  we  compute 


Trid„(a,  b,  a)  || 2  =  max 


b  +  2a  cos 


<  161  +  2|a|  max 


cos 


11 r 

n  +  1 

ITT 


71+  1 


<  |6|  +  2\a\  cos 


7T 


n  +  1 


Because  we  assumed  |6|  +  2\a\  =  1  and  because  cos(^+)  <  1  for  any  n  >  2, 
the  2-induced  norm  of  Tridn(a,  6,  a)  is  strictly  less  than  1.  Additionally,  for 
t  >  0,  we  bound  from  above  the  magnitude  of  the  curve  x  as: 


\x 


2  =  ||  Tridn(a,  6,  a)lx0 1|2  <  (  |6|  +  2|a|  cos 


IT 

71  +  1 


2- 


To  have  ||x(^)||2  <  e||xo||2,  it  is  sufficient  that  £log  ^|6|  +  2|a|  cos  yn  ^  ^  J  J  < 
logs,  that  is, 


t> 


-log 


loge  1 

+  2|a|cos(r^T)) 


(1.8.2) 


The  upper  bound  now  follows  by  noting  that,  as  t  — >  0,  we  have 

log(l  —  2|a|(l  —  cost))  |a|f2  ^  ^ 

Let  us  now  show  the  lower  bound.  Assume  without  loss  of  generality  that 
ab  >  0  and  consider  the  eigenvalue  6  +  2acos(^q-j-)  of  Trid„(a,  6,  a).  Note 
that  |6  +  2a cos(^(j-j-)|  =  |6|  +  2|a|  cos(^j-j-).  (If  ab  <  0,  then  consider  the 
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eigenvalue  b  +  2acos(^A-).)  For  n  >  2,  define  the  unit-length  vector 


v„  = 


S^n  n+ 1 

J  2 

V  n  +  1 

_S^n  n+T. 

(1.8.3) 


and  note  that,  by  Lemma  1.74(i),  vn  is  an  eigenvector  of  Tridn(a,  b ,  a)  with 
eigenvalue  b  +  2acos(^qrj-).  The  trajectory  x  with  initial  condition  vn  sat¬ 
isfies  ||x(£)||2  =  (j&|  +  2\a\  cos  ||vn||2,  and  therefore,  it  will  enter 

B(ln,  e|| v^ll 2)  only  when  £  satisfies  equation  (1.8.2).  This  completes  the 
proof  of  fact  (i). 

Next,  we  consider  fact  (ii).  Clearly,  all  eigenvalues  of  the  matrix  Tridn(a,  b,  0) 
are  strictly  inside  the  unit  disk.  For  £  >  0,  we  compute 


Tridn(a,  b,  0)* 


71—1 


=  be(ln  +  -  Tridn(l,  0, 0))  =&*£- 


l\ 


a\j 


Tridn(l,  0, 0)J 


because  of  the  nilpotency  of  Tridn(l,  0, 0).  Now,  we  can  bound  from  above 
the  magnitude  of  the  curve  x  as 


\x(t)\\2  =  ||  Tridn(a,  b,  O+X0II2 

71—1 


< 


i^E  ■ 


i\ 


i=0 


j\(£-j)\\b 


a\J 


Tridn(l,  0,  0)-7x0|L  <  ea/h£n  1  |6|f  ||x0||2. 


Here  we  used  ||  Tridn(l,  0, 0)-?a;o  1 1 2  <  ll^olb  and  max{  ySjy  |  j  G  {0, . . .  ,n  — 
1}}  <  £n _1.  Therefore,  in  order  to  have  ||x(£)||2  <  e||xo||2,  it  suffices  that 
log(ea/b)  +  (n  —  1)  \og£  +  £log  |6|  <  logs,  that  is, 


n  —  1 


log£  > 


-  logs 


-  log  |&|  —  log  |6| 

A  sufficient  condition  for  £  —  a  \og£  >  (3,  for  a,@  >  0,  is  that  £  >  2/3  + 
2amax{l,loga}.  For,  if  l  >  2 a,  then  log£  is  bounded  from  above  by  the 
line  £/2 a  +  logo.  Furthermore,  the  line  £/2 a  +  logct  is  a  lower  bound 
for  the  line  (£  —  /3)/a  if  £  >  2/3  +  2a  log  a.  In  summary,  it  is  true  that 
||x(£)||2  <  e||x(0)||2  whenever 


£  >  2 


-  logs 


+  2 


n  —  1 


—  log  \  b\  —  log  |6| 


max  <  1,  log 


n  —  1 
—  log  \  b\ 


This  completes  the  proof  of  the  upper  bound,  that  is,  fact  (ii). 

The  proof  of  fact  (iii)  is  similar  to  that  of  fact  (i).  Because  Circn(a,  b,  c)  is 
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circulant,  it  is  also  normal  and  each  of  its  singular  values  corresponds  to  an 
eigenvector,  eigenvalue  pair.  From  Lemma  1.74(ii)  and  from  the  assumption 
a  +  b  +  c  =  1,  it  is  clear  that  the  eigenvalue  corresponding  to  i  =  n  is  equal 
to  1;  this  is  the  largest  singular  value  of  Circn(a,  b,  c )  and  the  corresponding 
eigenvector  is  ln.  We  now  compute  the  second  largest  singular  value: 


max 

iE{l,...,n— 1} 


i2n 


n 


b  +  (a  +  c)  cos  -  +  v  — l(c  —  a)  sin  - 


i2n 


n 


1  -  (a  +  c)  (l  -  cos  (^))  +  -  a)  sin  ^ 


2tt\ 
n  / 


Here  ||  ■  ||c  is  the  norm  in  C.  Because  of  the  assumptions  on  a,b,c,  the  second 
largest  singular  value  is  strictly  less  than  1.  In  the  orthogonal  decomposition 
induced  by  the  eigenvectors  of  Circ n(a,b,c),  we  assume  that  the  vector  yo 
has  a  component  yave  along  the  eigenvector  ln.  For  £  >  0,  we  bound  the 
distance  of  the  curve  y(£)  from  yave In  as 


£/aveln||2 

=  ||  Circn(a,  b,  c)ey0  -  yave  1„||2  =  ||  Circn(a,  b,  cY(y0  - 

2ns\ 
n 


< 


1  —  (a  +  c)  ^1  —  cos  (  —  )  j  +  y/—\ (c  —  a) 


/2tt\ 
sin 


V  n  ) 


Uscve^-n)  1 1 2 
t 

1 1 2/0  ?/aveln||2- 

C 


This  proves  that  lim^+00  y{£)  =  yaveln.  Also,  for  a  =  a  +  c,  /3  =  c  —  a  and 
as  t  — >  0,  we  have 


log((l-a(l-cost))2  +  /?2sin2t)1/2  ^ 


+  0(1). 


Here  (32  <  a  because  a,c£  ] 0, 1[.  From  this,  one  deduces  the  upper  bound 
in  (iii). 


Now,  consider  the  eigenvalues  \n  =  6+(a+c)  cos  )+"v/~T (c— a)  sin  (^) 
and  An  =  b+(a+c)  cos  +i/^T(c— a)  sin  j  Qf  Circn(a,  b,  c), 

and  its  associated  eigenvectors  (cf.  Lemma  1.74(h)) 


1 

1 

Vn  = 

LO 

€  C\ 

Vn  = 

a;™"1 

oon~l 

LO 

(1.8.4) 


Note  that  the  vector  v„  +  vn  belongs  to  Mn.  Moreover,  its  component  yave 
along  the  eigenvector  1„  is  0.  The  trajectory  y  with  initial  condition  vn  +  v„ 
satisfies  |jy(£)||2  =  ||Af)vn  +  Anv„||2  =  |A„|^||vn  +  v„||2,  and  therefore,  it  will 
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enter  i?(On,e||vn  +  v„ 1 1 2 )  only  when 


loge 


-1 


£>  - f— 

-log  1  -  (a  +  c)(l  -  cos  (^)J  +  ^(c-a)sin  (^) 
This  completes  the  proof  of  fact  (iii). 


Proof  of  Theorem  1.77.  We  prove  fact  (i)  and  observe  that  the  proof  of 
fact  (ii)  is  analogous.  Consider  the  change  of  coordinates 


x{£)  =  P+ 


xave(^) 

.  y(£)  . 


—  x'a ve(^)ln  +  P+ 


0 

y(f) 


where  xave(£)  G  M  and  y(£)  G  Rn_1.  A  quick  calculation  shows  that  xave(£)  = 
-l^x(£),  and  the  similarity  transformation  described  in  equation  (1.6.7) 
implies 


y(£  +  1)  =  Tridn_i(a,  b,  a)  y{£),  and  x'ave(£  +  1)  =  (6  +  2 a)x'ave(£). 


Therefore,  xave  =  x'ave.  It  is  also  clear  that 
x{£  +  1)  -  xave(£  +  l)ln 


=  P, 


0 

y(£  + !) 


=  R 


0  0 
0  Tridn_i(a,  b,  a) 


P+  )(x(£)  -  xave(£)ln). 


Consider  the  matrix  in  parenthesis  determining  the  trajectory  £  1— *•  (x(£)  — 
xa.ve (£) In)  ■  This  matrix  is  symmetric,  its  singular  values  are  0  and  the  singu¬ 
lar  values  of  Tridn_i(a,  b,  a),  and  its  eigenvectors  are  ln  and  the  eigenvectors 
of  Tridn_i  (a,  b,  a)  (padded  with  an  extra  zero).  These  facts  are  sufficient  to 
duplicate,  step  by  step,  the  proof  of  fact  (i)  in  Theorem  1.76.  Therefore,  the 
trajectory  £  1— (x(£)  —  xave(£)ln )  satisfies  the  stated  properties.  ■ 


1.9  EXERCISES 

El.l  (Orthogonal  and  permutation  matrices).  Prove  that 

(i)  the  set  of  orthogonal  matrices  is  a  group; 

(ii)  the  set  of  permutation  matrices  is  a  group;  and 

(iii)  each  permutation  matrix  is  orthogonal. 

El. 2  (Doubly  stochastic  matrices).  Show  that  set  of  doubly  stochastic  matrices  is 
convex  and  that  it  contains  the  set  of  permutation  matrices.  Find  in  the  literature 
as  many  distinct  proofs  of  Theorem  1.2  as  possible. 

Hint:  A  proof  is  contained,  in  [Horn  and  Johnson,  1985].  A  second  proof  method 
is  based  on  combinatorics  methods. 

El. 3  (Circulant  matrices).  Given  two  nxn  circulant  matrices  Ci  and  C2,  show  that 
the  following  hold: 
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(i)  Cfi  ,  C\  +  C2  and  C1C2  are  circulant;  and 

(ii)  C1C2  =  C2C1. 

El. 4  (Spectral  radius  and  oo-induced  norm  of  a  row-stochastic  matrix).  Show 

that  the  spectral  radius  and  the  oo-induced  norm  of  a  row-stochastic  matrix  is  1. 
Hint:  Let  A  £  Rdxd  be  stochastic.  First,  show  ||^4||oo  <  1  by  direct  algebraic  ma¬ 
nipulation.  Second,  use  the  bound  in  Lemma  1.6  to  show  that  p(A)  <  1.  Finally, 
conclude  the  proof  by  noting  that  1  is  an  eigenvalue  of  A. 

Hint:  An  alternative  proof  that  p(A)  =  1  is  as  follows.  First,  use  Gersgorin 
Disks  Theorem  1.3  to  show  that  spec(A)  is  contained  in  the  unit-disk  centered  at 
the  origin.  Second,  note  that  p(A)  >  1  since  1  is  an  eigenvalue  of  A. 

El. 5  (Positive  semidefinite  matrix  defined  by  a  doubly-stochastic  and  irre¬ 

ducible  matrix).  Let  A  £  Rnx"  be  doubly-stochastic  and  irreducible.  Show 
that  the  matrix 


In  -  ATA 


is  positive  semidefinite  and  its  eigenvalue  0  is  simple. 

El. 6  (M-matrices).  This  exercise  summarizes  some  properties  of  the  so-called  M- 

matrices;  see  [Fiedler,  1986].  A  matrix  A  £  R71*"  is  an  M-matrix  (resp.  an 
Mo-matrix)  if 

(i)  all  the  off-diagonal  elements  of  A  are  zero  or  negative;  and 

(ii)  there  exist  a  nonnegative  matrix  C  £  R™X71  and  k  >  p(C)  (resp.  k  > 
p(C))  such  that  A  =  kln  —  C. 

Show  that 


El. 7 


(i)  the  matrix  B  £  Rnx"  is  an  M-matrix  if 

(a)  all  the  off-diagonal  elements  of  B  are  zero  or  negative;  and 

(b)  there  exists  a  vector  v  £  R71  with  positive  entries  such  that  Bv 
has  positive  entries; 

(ii)  if  A  is  an  Mo-matrix,  irreducible  and  singular,  then  there  exists  x  £  R71 
with  positive  entries  such  that  Ax  =  0  and  rank(A)  =  n  —  1;  and 

(iii)  if  A  is  an  M-matrix,  then  all  eigenvalues  of  A  have  positive  real  part. 


(Decomposition  of  a  stochastic  matrix).  Consider  the  matrix 


Show  that 


'1  -1  0 

0  1  -1 


T  = 


0 


L  n 


1 


n 


0 


1 

1 

n 


0 ' 
0 


£  R 


nXn 


-1 


1 

n  - 


(i)  T  is  invertible. 

(ii)  For  a  stochastic  matrix  F  £  Rnx",  there  exist  Fe rr  £  r(71_1)x(71_1)  and 
Cerr  £  Rlx^71^1^  such  that 


TFT-1 


Fe  rr 

Cerr  1 
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Moreover,  if  F  is  symmetric,  then  cerr  =  0ix(n-i)- 

El. 8  (The  closed  map  defined  by  a  finite  collection  of  continuous  maps).  Let 

/i ,  •  •  • ,  fm  :  X  — >  X  be  continuous  functions,  where  A'  is  a  d-dimensional  space 
chosen  among  Rd,  Sd,  and  the  Cartesian  products  Rdl  x  §>d2 ,  for  some  di  +d2  =  d. 
Define  the  set-valued  map  T  :  X  =1  X  by 

T(x)  =  {/i  .,fm(x)}. 

Show  that  T  is  closed  on  X. 

Hint:  Reason  by  contradiction. 

El. 9  (Overapproximation  Lemma).  Prove  Lemma  1.21. 

El.  10  (Acyclic  digraphs).  Let  G  be  an  acyclic  digraph.  Show  that: 

(i)  G  contains  at  least  one  source,  i.e.,  a  vertex  without  in-neighbors; 

(ii)  G  contains  at  least  one  sink,  i.e.,  a  vertex  without  out-neighbors;  and 

(iii)  in  an  appropriate  ordering  of  the  vertices  of  G,  the  adjacency  matrix  A 
is  lower-triangular,  i.e.,  all  its  entries  above  the  main  diagonal  vanish. 
Hint:  Order  the  vertices  of  G  according  to  their  distance  to  a  sink. 

El.  11  (A  sufficient  condition  for  a  matrix  to  be  primitive).  Show  that  if  A  € 
RnXn  is  nonnegative,  irreducible,  and  has  a  positive  element  on  the  diagonal,  then 
A  is  primitive.  Give  an  example  that  shows  that  this  condition  is  sufficient  but  not 
necessary,  i.e.,  find  a  primitive  matrix  with  no  positive  element  on  the  diagonal. 
Hint:  See  Exercise  El. 22  below  for  a  candidate  matrix. 

El. 12  (Condensation  digraph).  This  exercise  studies  the  decomposition  of  a  digraph 
G  in  its  strongly  connected  components.  A  subgraph  H  is  a  strongly  connected 
component  of  G  if  H  is  strongly  connected  and  any  other  subgraph  of  G  strictly 
containing  H  is  not  strongly  connected.  The  condensation  digraph  of  G,  denoted 
C(G),  is  defined  as  follows:  the  nodes  of  C(G)  are  the  strongly  connected  compo¬ 
nents  of  G,  and  there  exists  a  directed  edge  in  C(G)  from  node  H i  to  node  H2  if 
and  only  if  there  exists  a  directed  edge  in  G  from  a  node  of  Hi  to  a  node  of  LTj- 
Show  that: 

(i)  every  condensation  digraph  is  acyclic; 

(ii)  a  digraph  contains  a  globally  reachable  node  if  and  only  if  its  condensa¬ 
tion  digraph  contains  a  globally  reachable  node;  and 

(iii)  a  digraph  contains  a  directed  spanning  tree  if  and  only  if  its  condensation 
digraph  contains  a  directed  spanning  tree. 

El.  13  (Incidence  matrix).  Given  a  weighted  digraph  G  of  order  n,  choose  an  arbi¬ 
trary  ordering  of  its  edges.  Define  the  incidence  matrix  H(G)  £  RlslX71  of  G  by 
specifying  that  the  row  of  H(G)  corresponding  to  edge  (i,j)  has  an  entry  1  in 
column  i.  an  entry  —1  in  column  j,  and  all  other  entries  equal  to  zero.  Show  that 

H(G)TWH{G )  =  L(G)  +  L(rev(G)), 

where  W  G  rIbIxIbI  is  the  diagonal  matrix  with  in  the  entry  corresponding  to 
edge  (i,j). 

El.  14  (From  digraphs  to  stochastic  matrices  and  back).  Let  G  be  a  weighted  di¬ 
graph  of  order  n  with  adjacency  matrix  A,  out-degree  matrix  Dout,  and  Laplacian 
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matrix  L.  Define  the  following  matrices: 

F1  =  (k/„  +  Douty\nIn  +  A),  for  k  £  R>0, 

F2  =  In  -  eL,  for  e  £  [0,  min{(.DoUt)^1  |  i  €  {1, . . . ,  n}}[. 


Perform  the  following  tasks: 

(i)  Compute  the  entries  of  F\  and  F2  as  a  function  of  the  entries  of  A(G)\ 

(ii)  show  that  the  matrices  F\  and  F2  are  row-stochastic; 

(iii)  identify  the  least  restrictive  conditions  on  G  such  that  the  matrices  F\ 
and  F2  are  doubly  stochastic;  and 

(iv)  under  what  conditions  can  a  row-stochastic  matrix  be  written  in  the  form 
Fi ,  or  F2  for  some  appropriate  digraph  (and  for  some  appropriate  scalars 
k  and  e)? 

El. 15  (Metropolis-Hastings  weights  from  the  theory  of  Markov  chains).  Given 
an  undirected  graph  G  of  order  n,  define  a  weighted  adjacency  matrix  A  with 
entries 


1 

"u  l  +  max{|A/'(i)|,|A/'(j)|}’ 

for  (i,  j)  £  E.  Perform  the  following  tasks: 

(i)  Show  that  the  weighted  degree  of  any  vertex  is  strictly  smaller  than  1; 

(ii)  use  (i)  to  justify  that  e  =  1  can  be  chosen  in  Exercise  El. 14  for  the 
construction  of  the  matrix  F2;  and 

(iii)  express  the  exponential  convergence  factor  rexp(F2)  as  a  function  of  the 
eigenvalues  of  the  Laplacian  of  G. 

El.  16  (Some  properties  of  products  of  stochastic  matrices).  Show  that: 

(i)  if  the  matrices  Ai,...,Ak  are  nonnegative,  row-stochastic,  or  doubly 
stochastic,  respectively,  then  their  product  A\  ■  ■  ■  Ak  is  non-negative,  row- 
stochastic,  or  doubly  stochastic,  respectively; 

(ii)  if  the  nonnegative  matrices  Ai,...,Ak  have  strictly  positive  diagonal 
elements,  then  their  product  Ai  ■  ■  ■  Ak  has  strictly  positive  diagonal  ele¬ 
ments;  and 

(iii)  let  Gi, . . .  ,Gk  be  digraphs  associated  with  the  nonnegative  matrices 

A\ , . . . ,  Ak  with  strictly  positive  diagonal  elements.  If  the  digraph  Gi  U . . .  U  Gk 
is  strongly  connected,  then  the  matrix  Ai  ■  ■  ■  Ak  is  irreducible. 

El.  17  (Disagreement  function).  The  quadratic  form  associated  with  a  symmetric 
matrix  B  £  Rnxn  is  the  function  x  i— >  xT Bx.  Given  a  digraph  G  of  order  n,  the 
disagreement  function  <£>g  :  R™  — >  R  is  defined  by 

1  " 

$g(®)  =  g  5Z  ay(®j  -  xif-  (EL1) 

i,j= 1 


Show  the  following  are  true: 
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(i)  The  disagreement  function  is  the  quadratic  form  associated  with  the  sym¬ 
metric  positive-semidefinite  matrix 

P(G)  =  i (Amt (G)  +  An(G)  -  A(G)  -  A(G)T); 

(ii)  P(G)  =  l(L(G)  +  L(rev(G))). 

Hint:  A  sample  proof  is  provided  in  [Gao  et  al,  2008]. 

El.  18  (Weight-balanced  graphs  and  connectivity).  Let  G  be  a  weighted  digraph 
and  let  A  be  a  nonnegative  n  x  n  matrix.  Show  the  following  statements: 

(i)  If  G  is  weight-balanced  and  contains  a  globally  reachable  node,  then  it 
is  strongly  connected; 

(ii)  if  A  is  doubly  stochastic  and  its  associated  weighted  digraph  contains  a 
globally  reachable  node,  then  its  associated  weighted  digraph  is  strongly 
connected;  and 

(iii)  if  A  is  doubly  stochastic  and  a  column  of  is  positive,  then 

5Zfc=o  Ak  is  Positive. 

El.  19  (The  Laplacian  matrix  is  positive  semidefinite).  Without  relying  on  the 
Gersgorin  Disks  Theorem  1.3,  show  that,  if  the  weighted  digraph  G  is  undirected, 
then  the  matrix  L(G )  is  symmetric  positive  semidefinite.  (Note  that  the  proof  of 
statement  (i)  in  Theorem  1.34  relies  on  Gersgorin  Disks  Theorem  1.3). 

El. 20  (Properties  of  the  BFS  algorithm).  Prove  Lemma  1.25. 

El. 21  (LCR  algorithm).  Consider  the  following  LCR  algorithm  for  leader  election: 

(i)  Give  a  UID  assignment  to  each  processor  for  which  0(n2)  messages  are 
sent;  and 

(ii)  give  a  UID  assignment  to  each  processor  for  which  only  O(n)  messages 
are  sent. 

(iii)  Show  that  the  average  number  of  messages  sent  is  0(n  log  n),  where  the 
average  is  taken  over  all  possible  ordering  of  the  processors  on  the  ring, 
each  ordering  assumed  to  be  equally  likely. 

El. 22  (Properties  of  a  stochastic  matrix  and  its  associated  digraph).  Consider 
the  stochastic  matrices 


Ai 


0  1  1 
1  0  1 
1  1  0 


and  A2  =  — 


110  0 
0  0  11 
110  0 
0  0  11 


Define  and  draw  the  associated  digraphs  Gi  and  G2.  Without  relying  on  the 
characterization  in  Propositions  1.30  and  1.32,  perform  the  following  tasks: 

(i)  Show  that  the  matrices  A\  and  A2  are  irreducible  and  that  the  associated 
digraphs  Gi  and  G 2  are  strongly  connected; 

(ii)  show  that  the  matrices  A\  and  A2  are  primitive  and  that  the  associated 
digraphs  Gi  and  G2  are  strongly  connected  and  aperiodic;  and 

(iii)  show  that  the  averaging  algorithm  associated  with  A2  converges  in  a 
finite  number  of  steps. 
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El. 23  (Compactness  of  the  set  of  non-degenerate  matrices  with  respect  to  a 
parameter).  Show  that,  for  any  a  £  ]0, 1],  the  set  of  non-degenerate  matrices 
with  respect  to  a  is  compact. 

El. 24  (Laplacian  flow  [Olfati-Saber  and  Murray,  2004]).  Let  G  be  a  weighted 
directed  graph  with  a  globally  reachable  node.  Define  the  Laplacian  flow  on  R71 
by 


x  =  — L(G)x , 

or,  equivalently  in  components, 

ii  =  ^2  aij(xj  —  Xi),  i€{l,...,n}. 

j6Afout(i) 

Answer  the  following  questions: 

(i)  What  are  the  equilibrium  points? 

(ii)  show  that,  if  G  is  undirected,  then  the  disagreement  function  (see  Exer¬ 
cise  El. 17)  is  monotonically  non-increasing  along  the  Laplacian  flow; 

(iii)  given  xo  —  ((xo)i,  . . . ,  (®o)n)  £  Rn,  show  that  the  solution  t  hh >  x(t)  of 
the  Laplacian  flow  starting  at  xo  verifies 

min{(a;o)i, . . . ,  (®o)™}  <  xfit)  <  max{(i0)i, . . . ,  (*o)n}, 

for  all  t  £  R>o-  Use  this  fact  to  deduce  that  the  solution  t  >— >  x{t)  is 
bounded; 

(iv)  for  G  undirected,  use  (i)-(iii)  to  apply  the  LaSalle  Invariance  Principle  in 
Theorem  1.17  and  show  that  the  solutions  of  the  Laplacian  flow  converge 
to  diag(R"); 

(v)  find  an  example  G  such  that,  with  the  notation  in  Exercise  El. 17,  the 
symmetric  matrix  L(G)T  P(G)  +  P(G)L(G)  is  indefinite;  and 

Hint:  To  show  that  the  matrix  is  indefinite,  it  suffices  to  find  xi,X2  £ 

R"  such  that  xi  (L(G)tP(G)  +  P(G)L(G))x i  <  0  and  xi (L(G)TP{G)  + 
P(G)L(G))x 2  >  0. 

(vi)  show  that  the  Euler  discretization  of  the  Laplacian  flow  is  the  Laplacian- 
based  averaging  algorithm. 

El. 25  (Log-Sum-Exp  consensus  [Tahbaz-Salehi  and  Jadbabaie,  2006]).  Pick 
a£l\{0}  and  define  the  function  fa  :  Rn  — >  R  by 

1  n 

fa{x)  =  Q.  log 

1  i=  1 


Show  that 

(i)  lim  fa(x)  =  minjii, . . . ,  x„}  and  lim  fa(x)  =  max{ii, . . . ,  xn}',  and 

a— >0+ 

(ii)  lim  fa(x)=  lim  fa(x)  =  —  (aq  H - Vxn). 

OL — >  +  00  Ot — >  —  OO  XI 

Let  A  £  Rnxn  be  a  non-degenerate,  doubly  stochastic  matrix  whose  associated 
digraph  contains  a  globally  reachable  node.  Consider  the  discrete-time  dynamical 
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system 


n 

Wi(£+  1)  =  a  log  W/al  )  . 

2=1 


(iii)  Show  that  w(f)  — >  fa(w(0))l„  as  l  — >  +oo. 

El. 26  (The  theory  of  Markov  chains  and  random  walks  on  graphs).  List  as 
many  connections  as  possible  between  the  theory  of  averaging  algorithms  discussed 
in  Section  1.6.2  and  the  theory  of  Markov  chains.  Some  relevant  references  on 
Markov  chains  include  [Seneta,  1981,  Lovasz,  1993]. 

Hint:  There  is  a  one-to-one  correspondence  between  averaging  algorithms  and 
Markov  chains.  A  homogeneous  Markov  chains  corresponds  precisely  to  a  time- 
independent  averaging  algorithm.  A  reversible  Markov  chain  corresponds  precisely 
to  a  symmetric  stochastic  matrix. 

El. 27  (Distributed  hypothesis  testing  [Rao  and  Durrant-Whyte,  1993,  Olfati-Saber  et  al., 
2006]).  Let  hy,  for  7  £  F  in  a  finite  set  T,  be  a  set  of  alternative  hypotheses  about 
an  uncertain  event.  Suppose  that  n  nodes  take  measurements  Zi,  for  i  £  {1, . . . ,  n}, 
related  to  the  event.  Assume  that  each  observation  is  conditionally  independent 
of  all  other  observations,  given  any  hypothesis. 

(i)  Using  Bayes’  Theorem  and  the  independence  assumption,  show  that  the 
a  posteriori  probabilities  satisfy 

p(h-f]zi,...,Zn)  =  P^7-> - r  JJp(Zi|/l7). 

P\z  1  !■■■■,  zn) 

(ii)  Suppose  the  nodes  form  a  undirected  unweighted  connected  synchronous 
network  with  adjacency  matrix  A.  Consider  the  discrete-time  dynamical 
system 


7Ti(f+l)  =  ( TVi(e )  IK- 
2=1 


1/ (l  +  ^out  (*)) 


Fix  7  £  F,  set  7Ti(0)  =  p{zi\h1),  and  show  that  n(£) 
as  l  — >  +00. 


n 


p(.Zi  |  h~y  )  In 


(iii)  What  information  does  each  node  need  in  order  to  compute  the  maximum 
a  posteriori  estimate,  that  is,  to  estimate  the  most  likely  hypothesis? 

Hint:  Can  you  compute  p(zi, . . .  ,zn),  given  knowledge  of  p(h~y)  and  of 

Yl?=iP(zi\h-t)? 

As  a  bibliographical  note,  the  variable  nt  is  referred  to  as  the  belief  in  the  seminal 
work  by  Pearl  [1988]. 

El. 28  (Bounds  on  vector  norms).  Prove  Lemma  1.79. 

El. 29  (The  “n-bugs  problem”  and  cyclic  interactions).  The  “n-bugs  problem” 
related  to  the  pursuit  curves  from  mathematics,  inquires  about  what  are  the  paths 
of  n  bugs,  not  aligned  initially,  when  they  chase  one  another.  Simple  versions  of  the 
problem  (e.g.,  for  three  bugs  starting  at  the  vertices  of  an  equilateral  triangle)  were 
studied  as  early  as  the  19th  century.  It  was  in  [Watton  and  Kydon,  1969]  when  a 
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general  solution  for  the  general  n  bugs  problem  for  non-collinear  initial  positions 
was  given.  The  bugs  trace  out  logarithmic  spirals  that  eventually  meet  at  the 
same  point,  and  it  is  not  necessary  that  they  move  with  constant  velocity.  Surveys 
about  cyclic  pursuit  problems  are  given  in  the  papers  in  [Watton  and  Kydon,  1969, 
Marshall  et  ah,  2004],  Cyclic  pursuit,  has  also  been  recently  studied  in  the  multi¬ 
agent  and  control  literature,  e.g.,  see  [Bruckstein  et  ah,  1991,  Marshall  et  ah,  2004, 
Smith  et  ah,  2005].  In  particular,  the  paper  [Marshall  et  ah,  2004]  extends  the  71- 
bugs  problem  to  the  case  of  n  kinematic  unicycles  evolving  in  continuous  time. 

Consider  the  simplified  scenario  of  the  n-bugs  problem  placed  on  a  circle  of 
radius  r  and  suppose  that  the  bugs’  motion  is  constrained  to  be  on  that  circle. 
Assume  that  agents  are  ordered  counterclockwise  with  identities  i  £  {1, . . .  ,n}, 
where  we  identify  for  convenience  n  +  1  with  1.  Denote  by  Pi{£)  =  ( r,8i(£ ))  the 
sequence  of  positions  of  bug  i,  initially  at  Pi(0)  =  (r,  #;(0)). 

Cyclic  pursuit.  Suppose  that  each  bug  is  chasing  the  closest  counterclockwise 
neighbor  (according  to  the  order  we  have  given  them  on  the  circle),  see 
Figure  El. 1(a).  In  other  words,  each  bug  feels  an  attraction  towards  the 
closest  counterclockwise  neighbor  that  can  be  described  by  the  equation 

8i(£  +  1)  =  (1  —  k)6i(l)  +  k0i+i(l),  £  £  Z>0, 

where  k  £  [0, 1].  Determine  for  which  values  of  k  the  bugs  converge  to  a 
configuration  for  which  distc(#i+i,  #;)  =  distc(#i,  0;-i)  for  all*  £  {1, ... ,  n}. 
Observe  that  the  bugs  will  approach  this  equally-spaced  configuration  while 
moving  around  the  circle  indefinitely. 

Cyclic  balancing.  Suppose  that  each  bug  makes  a  compromise  between  chasing 
its  closest  counterclockwise  neighbor  and  the  closest  clockwise  neighbor,  see 
Figure  El. 1(b).  In  other  words,  each  bug  feels  an  attraction  towards  the 
closest  counterclockwise  and  clockwise  neighbors  that  can  be  described  by 
the  equation 

0i{£  -f-  1)  =  k6i+ i (€)  +  (1  —  2k')6i(£')  +  k8i— 1(^),  £  £  ^>o, 
where  k  £  [0, 1]. 

(i)  Determine  for  which  values  of  k  the  bugs  converge  to  a  configuration 
for  which  distc(0i+i,  8i)  =  distc(0;,  #i-i)  for  all  i  £  {1, ... ,  n}. 

(ii)  Show  that  the  bugs  will  approach  this  equally-spaced  configuration 
while  each  of  them  converges  to  a  stationary  position  on  the  circle. 

Hint:  Rewrite  the  cyclic  pursuit  and  cyclic  balancing  systems  in  terms  of  the  inter¬ 
bug  distances,  that  is,  in  terms  of  di{£)  =  distc(#i  +i (£),8i(£)),  i  £  {l,...,n},  £  £ 
Z>o .  Find  the  matrices  that  describe  the  linear  iterations  in  these  new  coordinates. 
Show  that  the  agreement  space,  i.e.,  the  diagonal  set  in  Rn,  is  invariant  under 
the  dynamical  systems.  Finally,  determine  which  values  of  k  make  each  system 
converge  to  the  agreement  space.  Lemma  1.74  might  be  of  use  in  this  regard. 
Regarding  part  (ii)b),  recall  that  an  exponentially  decaying  sequence  is  summable. 


91 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


(a)  (b) 

Figure  El.l  Illustration  of  the  n-bugs  problem.  In  (a),  agent  i  looks  at  the  position  of 
agent  i  + 1  and  moves  toward  it  by  an  amount  proportional  to  their  distance. 
In  (b),  agent  i  looks  at  the  position  of  agents  i  +  1  and  i  —  1  and  moves 
toward  the  one  which  is  furthest  by  an  amount  proportional  to  the  difference 
between  the  two  distances.  In  both  cases,  the  constant  of  proportionality 
is  k. 
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Chapter  Two 

Geometric  models  and  optimization 


This  chapter  presents  various  geometric  objects  and  geometric  optimiza¬ 
tion  problems  that  have  strong  connections  with  motion  coordination.  Ba¬ 
sic  geometric  notions  such  as  polytopes,  centers,  partitions,  and  distances 
are  ubiquitous  in  cooperative  strategies,  coordination  tasks,  and  the  inter¬ 
action  of  robotic  networks  with  the  physical  environment.  The  notion  of 
Voronoi  partition  finds  application  in  diverse  areas  such  as  wireless  com¬ 
munications,  signal  compression,  facility  location,  and  mesh  optimization. 
Proximity  graphs  provide  a  natural  way  to  mathematically  model  the  net¬ 
work  interconnection  topology  resulting  from  the  agents’  sensing  and/or 
communication  capabilities.  Finally,  multicenter  functions  play  the  role  of 
aggregate  objective  functions  in  geometric  optimization  problems.  We  in¬ 
troduce  these  concepts  here  in  preparation  for  the  later  chapters. 

The  chapter  is  organized  as  follows.  We  begin  by  presenting  basic  geo¬ 
metric  constructions.  This  gives  way  to  introduce  the  notion  of  proximity 
graphs  along  with  numerous  examples.  The  next  section  of  the  chapter 
presents  geometric  optimization  problems  and  multicenter  functions,  pay¬ 
ing  special  attention  to  the  characterization  of  their  smoothness  properties 
and  critical  points.  We  end  the  chapter  with  three  sections  on,  respectively, 
bibliographical  notes,  proofs  of  the  results  presented  in  the  chapter,  and 
exercises. 

2.1  BASIC  GEOMETRIC  NOTIONS 

In  this  section,  we  gather  some  classical  geometric  constructions  that  will 
be  invoked  profusely  throughout  the  book. 


2.1.1  Polygons  and  polytopes 

For  p,q  €  we  let  ]p,  q[=  {Xp  +  (1  —  A )q  |  A  G  ]0, 1[}  and  [p,  q]  =  {Ap+  (1  — 
A )q  |  A  G  [0, 1]}  denote  the  open  and  closed  segment  with  extreme  points  p 
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and  q,  respectively.  We  let  Hp^q  =  {i  £  Rd  ||x  —  p| 1 2  <  ||x  —  q || 2}  denote 
the  closed  halfspace  of  Rd  of  points  closer  (in  Euclidean  distance)  to  p  than 
to  q.  In  the  plane,  we  often  refer  to  a  halfspace  as  a  halfplane. 


As  seen  in  Section  1.2,  a  set  S  C  Rd  is  convex  if,  for  any  two  points  p,  q 
in  S,  the  closed  segment  \p,  q]  is  contained  in  S.  The  convex  hull  of  a  set 
is  the  smallest  (with  respect  to  the  inclusion)  convex  set  that  contains  it. 
We  denote  the  convex  hull  of  S  by  co(S).  For  S  =  {p\, . . .  ,pn}  finite,  the 
convex  hull  can  be  explicitly  described  as 

n 

CO (S)  =  { A1P1  H - b  A npn  I  Xi  >  0  and  ^  Xj  =  l}. 

i= 1 

Given  p  and  q  in  Rfi  and  a  convex  closed  set  Q  C  Wl  with  p  e  Q  (see 
Figure  2.1),  define  the  from-to-inside  function  by 


fti(p,  q,  Q ) 


q,  if  q  S  Q, 

\p,q]ndQ,  if  qfLQ. 


Figure  2.1  Illustration  of  the  from-to-inside  function  fti. 


The  function  fti  selects  the  point  in  the  closed  segment  [p,  q]  which  is 
at  the  same  time  closest  to  q  and  inside  Q.  Note  that  fti(p,  q,  Q)  depends 
continuously  on  p  and  q. 

A  polygon  is  a  set  in  R2  whose  boundary  is  the  union  of  a  finite  number  of 
closed  segments.  A  polygon  is  simple  if  its  boundary,  regarded  as  a  curve, 
is  non  self-intersecting.  We  will  only  consider  simple  polygons.  The  closed 
segments  composing  the  boundary  of  a  polygon  are  called  edges,  and  points 
resulting  from  the  pairwise  intersection  between  consecutive  edges  are  called 
vertices.  A  convex  polygon  can  be  written  as 

(i)  the  convex  hull  of  its  set  of  vertices;  or 

(ii)  the  intersection  of  halfplanes  defined  by  its  edges. 
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Two  vertices  whose  open  segment  is  contained  in  the  interior  of  the  polygon 
define  a  diagonal.  To  each  vertex  of  a  polygon  we  associate  an  interior  and 
an  exterior  angle.  A  vertex  is  strictly  convex  (resp.  strictly  nonconvex )  if  its 
interior  angle  is  strictly  smaller  (resp.  greater)  than  n  radians.  A  polygon 
is  nonconvex  if  it  has  at  least  one  strictly  concave  vertex.  The  perimeter  of 
a  polygon  is  the  length  of  its  boundary,  i.e.,  the  sum  of  the  lengths  of  its 
edges. 

A  polytope  is  the  generalization  of  the  notion  of  polygon  to  M.d,  for  d  >  3. 
In  this  book,  we  will  not  consider  nonconvex  polytopes  in  dimension  larger 
than  2.  As  for  convex  polygons,  a  (convex)  polytope  in  M0*  can  be  defined 
as  either  the  convex  hull  of  a  finite  set  of  points  in  Rd  or  the  bounded 
intersection  of  finite  set  of  halfspaces.  A  d—  1  face  (or  a  facet)  of  a  polytope  is 
the  intersection  between  the  polytope  and  the  boundary  of  a  closed  halfspace 
that  defines  the  polytope.  A  d— 2  face  is  a  d— 2  face  of  a  facet  of  the  polytope. 
The  faces  of  dimensions  0,  1,  and  d—1  are  called,  respectively,  vertices,  edges, 
and  facets.  For  a  convex  polytope  Q,  we  will  refer  to  them,  respectively,  by 
Ve(Q),  Ed (Q),  and  Fa(Q). 


2.1.2  Nonconvex  geometry 

In  this  section,  we  gather  some  basic  notions  on  nonconvex  geometry.  We 
consider  environments  that  include  nonconvex  polygons  as  a  particular  case. 

We  begin  with  some  visibility  notions.  Given  S  C  Rd,  two  points  p,q  E  S 
are  visible  to  each  other  if  the  closed  segment  \p,  q ]  is  contained  in  S.  The 
visibility  set  Vi (p;  S)  is  the  set  of  all  points  in  S  visible  from  p.  Given  r  >  0, 
the  range-limited  visibility  set  Vidisk(p;  S)  =  Vi(p;  S)  n  B(p,  r)  is  the  set  of 
all  points  in  S  within  a  distance  r  and  visible  from  p.  The  set  S  is  star-shaped 
if  there  exists  p  E  S  such  that  Vi  (p;  S )  =  S.  The  kernel  set  of  S  is  comprised 
of  all  the  points  with  this  property,  i.e.,  kernel  (S)  =  {p  E  S  |  Vi(p;  S)  =  S}. 
Trivially,  any  convex  set  is  star-shaped.  Given  6  E  R>o,  the  ^-contraction 
of  S  is  the  set  Sg  =  {p  E  S  |  dist (p,dS)  >  d}.  Note  that,  if  two  points 
p,q  £  S  are  visible  to  each  other  in  Sg,  then  any  point  within  distance  5  of 
p  and  any  point  within  distance  <5  of  q  are  visible  to  each  other.  Figure  2.2 
illustrates  these  visibility  notions. 

Next,  we  introduce  various  concavity  notions.  Given  S  C  Rd  connected 
and  closed,  p  G  dS  is  strictly  concave  if  for  any  e  G  M>o,  there  exist  q\ ,  q‘>  G 
B(p,  e)  n  dS  such  that  [qi,  <72]  ft-  S.  This  definition  coincides  with  the  notion 
of  strictly  concave  vertex  when  the  set  S’  is  a  polygon.  A  strict  concavity 
of  S  is  either  an  isolated  strictly  concave  point  or  a  concave  arc,  i.e.,  a 
connected  set  of  strictly  concave  points.  An  allowable  environment  S  C  R2 
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Figure  2.2  Illustration  of  various  visibility  notions.  The  visibility  set  Vi(pi;  S)  from  pi  in 
S,  the  visibility  set  Vi(pi;Sj)  from  pi  in  Ss,  and  the  range-limited  visibility 
set  Vidisk(p2;  S)  from  p2  in  S  are  depicted  in  light  gray.  The  dashed  curve  in 
the  interior  of  S  corresponds  to  the  boundary  of  the  5-contraction  of  S.  The 
points  P2  and  qi  are  visible  to  each  other  in  Ss-  The  points  qi  and  q2  are  visible 
to  each  other  in  S,  but  they  are  not  visible  to  each  other  in  Ss- 


is  a  set  that  satisfies  the  following  properties:  it  is  closed,  simply  connected, 
has  a  finite  number  of  strict  concavities  and  its  boundary  can  be  described 
by  a  continuous  and  piecewise  continuously  differentiable  curve  which  is  not 
differentiable  at  most  at  a  finite  number  of  points.  Figure  2.3  shows  a  sample 
allowable  environment.  Given  an  allowable  environment  S,  let  v  belong  to  a 
concave  arc  and  such  that  the  boundary  of  S  is  continuously  differentiable 
at  v.  The  internal  tangent  halfplane  Hg(v)  is  the  closed  halfplane  whose 
boundary  is  tangent  to  dS  at  v  and  whose  interior  does  not  contain  any 
points  of  the  strict  concavity;  see  Figure  2.3. 

The  following  result  presents  an  interesting  property  of  allowable  environ¬ 
ments.  Its  proof  is  left  to  the  reader. 

Lemma  2.1  (Contraction  of  allowable  environments).  Given  an  al¬ 
lowable  environment  S,  the  5-contraction  Ss  is  also  allowable  for  sufficiently 
small  5  £  M>o  and  does  not  have  isolated  strictly  concave  points.  Further¬ 
more,  the  boundary  of  Ss  is  continuously  differentiable  at  the  concavities. 

Lemma  2.1  implies  that  the  internal  tangent  halfplane  is  well-defined  at 
any  strict  concavity  of  the  ^-contraction  Ss- 

A  set  S  C  X  is  relatively  convex  in  X  C  if,  for  any  two  points  p, 
q  in  S,  the  shortest  curve  in  X  that  connects  p  and  q  is  contained  in  S. 
Relatively  convex  sets  in  are  just  convex  sets.  The  relative  convex  hull 
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Figure  2.3  An  allowable  environment  S.  The  curved  portion  of  the  boundary  is  a  concave 
arc.  The  vertices  whose  interior  angle  is  3ir/2  radians  are  isolated  strictly 
concave  points.  The  relative  convex  hull  of  {pi, . . .  ,pe}  in  S  is  depicted  in 
light  gray.  Finally,  the  dashed  line  represents  the  boundary  of  the  internal 
tangent  halfplane  Hs{v)  tangent  to  dS  at  v. 


of  a  set  S  in  X  is  the  smallest  (with  respect  to  the  operation  of  inclusion) 
relatively  convex  set  in  X  that  contains  S ,  see  Figure  2.3.  We  denote  the 
relative  convex  hull  of  S  in  X  by  rco(S';X).  The  (relative)  perimeter  of  S 
in  X  is  the  length  of  the  shortest  measurable  closed  curve  contained  in  X 
that  encloses  S. 


2.1.3  Geometric  centers 

Let  X  =  X  =  §d  or  X  =  x  §d2,  d  =  d\  +  cfe.  Recall  our  convention 
(cf.  Section  1.1.3)  that,  unless  otherwise  noted,  Md  is  endowed  with  the 
Euclidean  distance,  is  endowed  with  the  geodesic  distance,  and  Mdl  x  S^2 
is  endowed  with  the  Cartesian  product  distance  (dist2,  distg). 

The  circumcenter  of  a  bounded  set  S  C  X,  denoted  by  CC(S),  is  the  cen¬ 
ter  of  the  closed  ball  of  minimum  radius  that  contains  S.  The  circumradius 
of  S,  denoted  by  CR(S'),  is  the  radius  of  this  ball1.  The  circumcenter  is 
always  unique. 

The  computation  of  the  circumcenter  and  the  circumradius  of  a  polytope 
Q  C  M.d  is  a  strictly  convex  problem  and  in  particular  a  quadratically  con¬ 
strained  linear  program  in  p  (the  center)  and  r  (the  radius).  It  consists  of 
minimizing  the  radius  r  of  the  ball  centered  at  p  subject  to  the  constraints 
that  the  distance  between  q  and  each  of  the  polygon  vertices  is  smaller  than 


1Note  that  the  definition  of  circumcenter  given  here  is  in  general  different  from  the  classical 
notion  of  circumcenter  of  a  triangle,  i.e.,  the  center  of  the  circle  passing  through  the  three  vertices 
of  the  triangle. 
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or  equal  to  r.  Formally,  the  problem  can  be  expressed  as 


minimize  r , 


subject  to  || q  —  p\\\  <  r2,  for  all  q  E  Xe(Q).  (2.1.1) 


Next,  we  summarize  some  useful  properties  of  the  circumcenter  in  Euclidean 
space,  see  Exercise  E2.1  for  their  proofs.  In  the  following  result,  for  S  E 
F(Md)  with  d  =  1,  we  let  Ve(co(5))  denote  the  set  of  extreme  points  of  the 
interval  co (S). 

Lemma  2.2  (Properties  of  the  circumcenter  in  Euclidean  space). 

Let  S  =  {pi, . . .  ,pn}  E  F(Rd)  with  n  >  2.  The  following  properties  hold: 

(i)  CC{S)  E  co(S')  \  Ve(co(S));  and 

(ii)  if  p  E  co(S')  \  {CC(S’)}  and  r  E  M>o  are  such  that  S  C  B(p,r), 

then  ]p,  CC(<S)[  has  a  nonempty  intersection  with  f)  for  all 

q  E  co(5). 


Figure  2.4  Circumcenter  and  circumradius  (left),  and  incenter  and  inradius  (right)  of  a 


convex  polygon. 


Given  X  =  Md,  X  =  §>d  or  X  =  Mdl  x  §da,  d  =  d\  +  cfe,  the  incenter 
or  Chebyshev  center  of  a  compact  set  S  C  X,  denoted  by  IC(5),  is  the  set 
containing  the  centers  of  all  closed  balls  of  maximum  radius  contained  in  S. 
The  inradius  of  S,  denoted  by  IR(S'),  is  the  common  radius  of  any  of  these 
balls. 

The  computation  of  the  incenter  and  the  inradius  of  a  polytope  Q  C  Wl  is 
a  convex  problem,  and  in  particular  a  linear  program  in  p  and  r.  It  consists 
of  maximizing  the  radius  r  of  the  ball  centered  at  p  subject  to  the  constraints 
that  the  distance  between  p  and  each  of  the  polytope  facets  is  greater  than 
or  equal  to  r.  Formally,  the  problem  can  be  expressed  as  follows.  For  each 
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/  £  Fa(Q),  select  a  point  qf  £  Q  belonging  to  /.  Then,  we  set 
maximize  r , 

subject  to  (p  —  qf)  ■  ninj  >  r  ,  for  all  /  £  Fa(Q),  (2.1.2) 

where  nnij  denotes  the  normal  to  the  face  /  pointing  toward  the  interior  of 
the  polytope.  The  incenter  of  a  polytope  is  not  necessarily  unique  (consider, 
for  instance,  the  case  of  a  rectangle). 

In  Euclidean  space,  X  =  M.d,  we  refer  to  a  bounded  measurable  function 
:  Md  — >  M>o  as  a  density  on  Mrf.  The  ( generalized )  area  and  the  centroid 
of  a  bounded  measurable  set  S  C  with  respect  to  <f,  denoted  respectively 
by  Afj)(S)  and  CM^(5),  are  given  by 

A ^(5)  =  ^  4>(q)dq,  CM ^(S)  =  A  ^  q<j>{q)dq. 

When  the  function  <fi  being  used  is  clear  from  the  context,  we  simply  refer 
to  the  area  and  the  centroid  of  S.  The  centroid  can  alternatively  be  defined 
as  follows.  Define  the  polar  moment  of  inertia  of  S  about  p  £  S  by 

J<t>(s,p)=  f  \\q  -  pWIHq)^- 

Js 

Then,  the  centroid  of  S  is  precisely  the  point  p  £  S  that  minimizes  the  polar 
moment  of  inertia  of  S  about  p.  This  can  be  easily  seen  from  the  Parallel 
Axis  Theorem  [Hibbeler,  2006],  that  states 

J 0(5, p)  =  J*(S,  CM*(S))  +  A0(5)||p  -  CM0(5)||i. 

Remark  2.3  (Computation  of  geometric  centers  in  the  plane).  The 

circumcenter,  incenter,  and  centroid  of  a  polygon  can  be  computed  in  several 
ways.  A  simple  procedure  to  compute  the  circumcenter  consists  of  enumer¬ 
ating  all  pairs  and  triplets  of  vertices  of  the  polygon,  compute  the  centers 
and  radiuses  of  the  balls  passing  through  them,  and  selecting  the  ball  with 
the  smallest  radius  that  encloses  the  polygon.  An  alternative,  more  efficient 
way  of  computing  the  circumcenter,  is  to  use  the  formulation  (2.1.1).  A 
convex  quadratically  constrained  linear  program  is  a  particular  case  of  a 
semidefinite-quadratic-linear  program  (SQLP).  Several  freely-available  nu¬ 
merical  packages  exist  to  solve  SQLP  problems,  e.g.,  SDPT3  [Tutuncu  et  ah, 
2003].  The  computation  of  the  incenter  set  of  a  polygon  can  be  performed 
via  linear  programming  using  the  formulation  (2.1.2).  Finally,  the  centroid 
of  a  polygon  can  be  computed  with  any  numerical  routine  that  accurately 
approximates  the  integral  of  a  function  over  a  planar  domain.  • 
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2.1.4  Voronoi  and  range-limited  Voronoi  partitions 

A  partition  of  a  set  5  is  a  subdivision  of  S  into  connected  subsets  that  are 
disjoint  except  for  their  boundary.  Formally,  a  partition  of  S'  is  a  collection 
of  closed  connected  sets  {W\, . . . ,  Wmj  C  P(S)  that  verify 

S  =  U™  l  Wi  and  int (Wj)  0  int(Wfc)  =  0, 

for  all  j,  k  G  {1, . . . ,  m}. 

Definition  2.4  (Voronoi  partition).  Given  a  distance  function  dist  : 
X  x  X  — >  M>o,  a  set  S  C  X  and  n  distinct  points  V  =  {p\, . . .  ,pn}  in  S, 
the  Voronoi  partition  of  S  generated  by  V  is  the  collection  of  sets  V(V)  = 
. . . ,  Vn(V)}  C  P(S)  defined  by,  for  each  i  G  {1, . . .  ,n}, 

ViiV)  =  {q  G  S  |  dist (q, pi)  <  dist (q,Pj),  for  all  pj  G  V}.  • 


In  other  words,  Vj('P)  is  the  set  of  the  points  of  S  that  are  closer  to  pi 
than  to  any  of  the  other  points  in  V.  We  will  refer  to  ViiV)  as  the  Voronoi 
cell  of  pi . 

Unless  explicitly  noted  otherwise,  we  compute  the  Voronoi  partition  ac¬ 
cording  to  the  following  conventions: 


•  For  X  =  M.d,  with  respect  to  the  Euclidean  distance; 

•  for  X  =  E>d,  with  respect  to  the  geodesic  distance;  and 

•  for  X  =  Rdl  x  §d2 ,  d\  +  d,2  =  d,  with  respect  to  the  Cartesian  product 
distance  determined  by  dist2  on  Mrfl  and  dist5  on  S^2. 


In  the  Euclidean  case,  the  Voronoi  cell  of  pi  is  equal  to  the  intersection  of 
half-spaces  determined  by  pi  and  the  other  locations  in  V,  and  as  such  it  is  a 
convex  polytope.  The  left  plot  in  Figure  2.5  shows  an  example  of  the  Voronoi 
partition  of  a  convex  polygon  generated  by  40  points.  Figure  2.6  shows  an 
example  of  the  Voronoi  partition  of  the  circle  generated  by  5  points. 

Definition  2.5  (r-limited  Voronoi  partition).  Given  a  distance  function 
dist  :  X  x  X  — >  M>o,  a  set  S  C  X,  n  distinct  points  V  =  {'P\ , . . .  ,pn}  in  S, 
and  a  positive  real  number  r  G  M>o,  the  r-limited  Voronoi  partition  inside 
S  generated  by  V  is  the  collection  of  sets  Vr(V)  =  {V\X{V), . . . ,  VHj r(V)}  C 
P (S)  defined  by 

Vi>r(V)  =  Vi(V)nB(pi,r),  i  G  {1, . . . ,  n}.  • 
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Figure  2.5  Voronoi  partition  of  a  convex  polygon  (left)  and  r-limited  Voronoi  partition 
inside  a  convex  polygon  (right)  generated  by  40  points.  The  colored  regions 
correspond  to  the  Voronoi  cells  and  the  r-limited  Voronoi  cells,  respectively,  of 
each  individual  point. 


Figure  2.6  Voronoi  partition  of  the  circle  generated  by  5  points.  The  dashed  segments 
correspond  to  the  Voronoi  cells  of  each  individual  point. 
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Note  that  the  r-limited  Voronoi  partition  inside  S  is  precisely  the  Voronoi 
partition  of  the  set  U 2=iB(pi,r)  0  S.  We  will  refer  to  V^r {V)  as  the  r- 
limited  Voronoi  cell  of  p±.  The  right  plot  in  Figure  2.5  shows  an  example 
of  the  r-limited  Voronoi  partition  inside  a  convex  polygon  generated  by  40 
points. 

Let  X  =  Mrf,  X  =  §>d  or  X  =  Mdl  x  §rf2,  d  =  d\  +  d%.  Given  a  density  <j) 
on  X,  a  set  of  n  distinct  points  V  =  {pi, . . .  ,pn}  in  S  C  X  is  a 

(i)  centroidal  Voronoi  configuration  if  each  point  is  the  centroid  of  its 
own  Voronoi  cell,  i.e.,  pi  = 

(ii)  r-limited  centroidal  Voronoi  configuration,  for  r  E  M>o,  if  each 
point  is  the  centroid  of  its  own  r-limited  Voronoi  cell,  that  is, 

Pi  =  CM^(Vj!?.('P)).  If  r  >  diam(S'),  then  an  r-limited  centroidal 
Voronoi  configuration  is  a  centroidal  Voronoi  configuration; 

(iii)  circumcenter  Voronoi  configuration  if  each  point  is  the  circumcenter 
of  its  own  Voronoi  cell,  i.e.,  pt  =  CC (Vi(V));  and 

(iv)  incenter  Voronoi  configuration  if  each  point  is  an  incenter  of  its  own 
Voronoi  cell,  i.e.,  pi  E  ICiVfiV)). 

Figure  2.7  presents  an  illustration  of  the  various  notions  of  center  Voronoi 
configurations. 


Figure  2.7  From  left  to  right,  centroidal,  r-limited  centroidal,  circumcenter,  and  incenter 
Voronoi  configurations  composed  by  16  points  in  a  convex  polygon.  Darker 
blue-colored  areas  correspond  to  higher  values  of  the  density  <j>. 


2.2  PROXIMITY  GRAPHS 

Roughly  speaking,  a  proximity  graph  is  a  graph  whose  vertex  set  is  a  set  of 
distinct  points  and  whose  edge  set  is  a  function  of  the  relative  locations  of 
the  point  set.  Proximity  graphs  appear  in  computational  geometry.  In  this 
section  we  study  this  important  notion  in  detail  following  the  presentation 

by  Cortes  et  al.  [2005]. 

Definition  2.6  (Proximity  graph).  Let  S  C  X,  where  X  is  a  d-dimensional 
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space  chosen  among  Md,  E>d ,  and  the  Cartesian  products  Mdl  x  §rf2,  for  some 
d\  +  d2  =  d.  Let  G(S)  be  the  set  of  all  undirected  graphs  whose  vertex  set 
is  an  element  of  ¥(S).  A  proximity  graph  Q  :  ¥ (5)  — >  G (S)  associates  to  a 
set  of  distinct  points  V  =  {p±, . . .  ,pn}  C  S  an  undirected  graph  with  vertex 
set  V  and  whose  edge  set  is  £g{V)  C  {(p,  q)  e  V  x  V  \  p  /  q}.  • 

Note  that  in  a  proximity  graph  a  point  cannot  be  its  own  neighbor.  From 
this  definition,  we  observe  that  the  distinguishing  feature  of  proximity  graphs 
is  that  their  edge  sets  change  with  the  location  of  their  vertices.  It  is  also 
possible  to  define  proximity  graphs  that  associate  to  each  point  set  a  digraph, 
but  we  will  not  consider  them  here. 

Examples  of  proximity  graphs  on  X,  where  recall  that  dist  =  dist2  if 
X  =  Rd,  dist  =  dist9  if  A  =  §>d,  and  dist  =  (dist2,  dists)  if  X  =  Rdl  x  §d2, 
include  the  following. 

(i)  The  complete  graph  Qc mpit  where  any  two  points  are  neighbors. 

When  convenient,  we  may  view  the  complete  graph  as  weighted 
by  assigning  the  weight  dist(pi,pj)  to  the  edge  (pupj)  G  £gcmplt  {V ) ; 

(ii)  the  r-disk  graph  f/diskMi  f°r  r  £  ^>o>  where  two  points  are  neigh¬ 
bors  if  they  are  located  within  a  distance  r,  that  is,  ( Pi,Pj )  G 
£Sdisk(r)CP)  if  dist (pi,pj)  <  r; 

(iii)  the  Delaunay  graph  where  two  points  are  neighbors  if  their 
corresponding  Voronoi  cells  intersect,  that  is,  ( Pi,Pj )  G  £gD(V)  if 

(iv)  the  r-limited  Delaunay  graph  I/ldIt);  f°r  r  ^  ^>o;  where  two  points 
are  neighbors  if  their  corresponding  ^-limited  Voronoi  cells  intersect, 
that  is,  (pi,pj)  G  £g^u{r){V)  if  V^ziV)  PI  Vjtz(V)  /  0;  and 

(v)  the  relative  neighborhood  graph  <7rn,  where  two  points  are  neighbors 
if  their  associated  open  lune  (cf.  Section  1.1.3)  does  not  contain  any 

point  in  V ,  that  is,  (pi,Pj)  G  £gRN{V)  if  pk  0  B  (p^ ,  dist  (pi  ,pj))C\B  (pj ,  dist  (pi  ,Pj)) 
for  all  pk  eV,k(£  {i,  j}; 

Figure  2.8  shows  examples  of  these  proximity  graphs  in  the  plane. 

Additional  examples  of  proximity  graphs  in  the  Euclidean  space  include 
the  following. 

(vi)  The  Gabriel  graph  Qq,  where  two  points  are  neighbors  if  the  ball 
centered  at  their  midpoint  and  passing  through  both  of  them  does 
not  contain  any  point  in  V,  that  is,  ( Pi,Pj )  G  £ga{V)  if  pk  0 

B(^±,  dist(pi,pj) )  for  allpkeV. 

(vii)  the  r -00 -disk  graph  C/oo-diskM,  for  r  G  M>o,  where  two  points  are 
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/  / 

HI: 

(a)  ihrri  pi  t  (b)  fAlisk  (?')  (c)  Q I  j 


(d)  Ghl)(r)  (e)  C/rn 

Figure  2.8  Proximity  graphs  in  R2.  From  left  to  right,  in  the  first  row,  complete,  r- 
disk,  and  Delaunay,  and  in  the  second  row,  r-limited  Delaunay  and  relative 
neighborhood  for  a  set  of  15  points.  When  appropriate,  the  geometric  objects 
determining  the  edge  relationship  are  plotted  in  lighter  gray. 


neighbors  if  they  are  located  within  a  L°°-distance  r,  that  is,  ( Pi,Pj )  G 
^oo-diSk(r)(^)  if  distoo (pi,pj)  <  r; 

(viii)  the  Euclidean  minimum  spanning  tree  <7emst,5  °f  a  proximity  graph 
G,  that  assigns  to  each  V  a  minimum- weight  spanning  tree  (cf. 
Section  1.4. 4. 4)  of  G(V)  with  weighted  adjacency  matrix  = 

]| Pi  —  Pj\ 1 2 ,  for  ( Pi,Pj )  E  £g(V).  If  G(V)  is  not  connected,  then 
I?emst,c?(D  is  the  union  of  Euclidean  minimum  spanning  trees  of 
its  connected  components.  When  G  is  the  complete  graph,  we  sim¬ 
ply  denote  the  Euclidean  minimum  spanning  tree  by  Hemst  j 

(ix)  the  visibility  graph  Gvis,Q  in  an  allowable  environment  Q  in  M2, 
where  two  points  are  neighbors  if  they  are  visible  to  each  other, 
that  is,  ( PiiPj )  E  ^Gvis,Q('P)  if  the  closed  segment  \pi,Pj\  from  pi  to 
Pj  is  contained  in  Q ;  and 

(x)  the  range-limited  visibility  graph  f/vis-disk,Q  in  an  allowable  environ¬ 
ment  Q  in  M2,  where  two  points  are  neighbors  if  they  are  visible  to 
each  other  and  their  distance  is  no  more  than  r,  that  is,  ( Pi,Pj )  G 
&?vis-dis kq(V)  if  and  only  if  ( Pi,Pj )  G  Sgvis.Q(V)  and  (pi,Pj)  G  £gdiek(r)(' P) 

Figure  2.9  shows  examples  of  these  proximity  graphs  in  the  plane. 
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(d)  {?vis,Q  (®)  ^vis-disk, Q 

Figure  2.9  Proximity  graphs  in  R2.  From  left  to  right,  in  the  first  row,  Gabriel,  r-oo- 
disk,  and  Euclidean  minimum  spanning  tree  graphs  for  a  set  of  15  points,  and 
in  the  second  row,  visibility  and  range-limited  visibility  graphs  for  a  set  of  8 
agents  in  an  allowable  environment.  When  appropriate,  the  geometric  objects 
determining  the  edge  relationship  are  plotted  in  lighter  gray. 
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Figure  2.10  shows  some  examples  of  proximity  graphs  in  three-dimensional 
space. 


Figure  2.10  Proximity  graphs  in  R3.  From  left  to  right,  r-disk,  relative  neighborhood, 
and  Gabriel  graphs  for  a  set  of  25  points. 


As  for  standard  graphs,  let  us  alternatively  describe  the  edge  set  by  means 
of  the  sets  of  neighbors  of  the  individual  graph  vertices.  To  each  proximity 
graph  Q,  each  pel  and  each  V  =  {p\ , . . . ,  pn}  E  F(X),  we  associate  the 
set  of  neighbors  map  Mg  :  X  x  Fpf)  —  F(V)  defined  by 

Ng{p,V)  =  {q  €V  |  {p,q)  E  £g(VU{p})}. 

Typically,  p  is  a  point  in  V,  but  the  definition  is  well-posed  for  any  p  E 
X.  Under  the  assumption  that  V  does  not  contain  repeated  elements,  the 
definition  will  not  lead  to  counterintuitive  interpretations  later.  Given  p  E 
X,  it  is  convenient  to  define  the  map  MgtP  :  F(X)  — >  F(X)  by  Mg}P(V)  = 
Mg{p,V). 

A  proximity  graph  Gi  is  a  subgraph  of  a  proximity  graph  Q 2 ,  denoted  Q\  C 
Q2 ,  if  Gi(V)  is  a  subgraph  of  G2(V)  for  all  V  E  F(A).  The  following  result, 
whose  proof  is  given  in  Section  2.5.1,  summarizes  the  subgraph  relationships 
in  the  Euclidean  case  among  the  various  proximity  graphs  introduced  above. 

Theorem  2.7  (Subgraph  relationships  among  some  standard  prox¬ 
imity  graphs  on  Wl).  For  r  E  M>o,  the  following  statements  hold. 

(i)  <?EMST  c  <5rn  cfcc  G d;  and 

(ii)  Gg  G  f?disk(r )  c  <?Ld(0  c  ^DG^diskW- 

Remark  2.8  (The  r-limited  Delaunay  graph  is  not  equal  to  the 
intersection  of  the  Delaunay  graph  and  the  r-disk  graph).  The  in¬ 
clusion  Q ld(?")  C  H  f?disk(r)  is  in  general  strict;  this  counterintuitive  fact 
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is  discussed  in  Exercise  E2.2.  • 

Note  that,  since  Uemst  is  by  definition  connected,  Theorem  2.7(i)  implies 
that  Urn,  Ug  and  Ud  are  connected,  the  connectivity  properties  of  Udisk(?”) 
are  characterized  in  the  following  result. 

Theorem  2.9  (Connectivity  properties  of  some  standard  proximity 
graphs  on  Rd).  For  r  E  M>o,  the  following  statements  hold. 

(i)  Uemst  C  UdiskM  if  and  only  if  Gdisk(r)  is  connected;  and 

(a)  Uemst  O  Udisk  (?" ) ;  Urn  O  UdiskM,  Ug  H  Udisk(r)  and  Ggg{t)  have  the 
same  connected  components  as  Udisk(0  (i.e.,  for  all  point  sets  V  E 
F(Mrf),  all  graphs  have  the  same  number  of  connected  components 
consisting  of  the  same  vertices). 

The  proof  of  this  theorem  is  given  in  Section  2.5.1.  Note  that  in  Theo¬ 
rem  2.9,  fact  (ii)  implies  (i). 


2.2.1  Spatially  distributed  proximity  graphs 

We  now  consider  the  following  loosely-stated  question:  when  a  does  a  given 
proximity  graph  encode  sufficient  information  to  compute  another  proximity 
graph.  For  instance,  if  a  node  knows  the  position  of  its  neighbors  in  the  com¬ 
plete  graph  (that  is,  of  every  other  node  in  the  graph),  then  it  is  clear  that 
the  node  can  compute  its  neighbors  with  respect  to  any  proximity  graph. 
Let  us  formalize  this  idea.  A  proximity  graph  G i  is  spatially  distributed  over 
a  proximity  graph  G 2  if,  for  all  p  E  V, 

NguP(V)  =  NguP{Ng2tP{V)) , 

that  is,  any  node  informed  about  the  location  of  its  neighbors  with  respect 
to  G2  can  compute  its  set  of  neighbors  with  respect  to  G 1  • 

Clearly,  any  proximity  graph  is  spatially  distributed  over  the  complete 
graph.  It  is  straightforward  to  deduce  that  if  G 1  is  spatially  distributed  over 
G 2,  then  G 1  is  a  subgraph  of  G 2-  The  converse  is  in  general  not  true.  For 
instance,  Gg  H  UdiskM  is  a  subgraph  of  Udisk(0>  but  Gg  H  UdiskM  is  not 
spatially  distributed  over  UdiskM,  see  Exercise  E2.3. 

The  following  result  identifies  proximity  graphs  which  are  spatially  dis¬ 
tributed  over  <5disk(r)- 

Proposition  2.10  (Spatially  distributed  graphs  over  the  r-disk  graph) 

The  proximity  graphs  Urn  H  Udisk(0>  Ug  C  Udisk(r)>  and  Uld(t)  are  spatially 
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distributed  over  t/diskM- 

Remark  2.11  (Computation  of  the  Delaunay  graph  over  the  r-disk 
graph).  In  general,  for  a  fixed  r  £  M>o,  Gd  is  not  spatially  distributed  over 
£diskM-  However,  for  a  given  V  £  F(X),  it  is  always  possible  find  r  such 
that  Gd(P)  is  spatially  distributed  over  Gdisk(r)(V).  This  is  a  consequence 
of  the  following  observations.  Given  V  £  F(V),  define  the  convex  sets 

W(pi,r)  =  B(pur)  O  (nn^-pjH  <rHP(tPj),  i  £  {l,...,n}, 

where  recall  that  Hpx  is  the  half-space  of  points  q  in  M.d  with  the  property 
that  ||g  —  p\\z  <  ||g  — x||2-  Note  that  the  intersection  B(pi,r)r\  Vi  is  a  subset 
of  W(pi,r).  Provided  r  is  twice  as  large  as  the  maximum  distance  between 
Pi  and  the  vertices  of  W  (pt ,  r) ,  then  all  Delaunay  neighbors  of  pi  are  within 
distance  r  from  pt.  Equivalently,  the  half-space  Hpi>p  determined  by  pi  and  a 
point  p  outside  B(pi,r)  does  not  intersect  W(pi,r).  Therefore,  the  equality 
Vi  =  W ( pi ,  r)  holds.  For  node  i  £  {1, . . . ,  n},  the  minimum  adequate  radius 
is  then 


in  =  2  max{||pj  -  q\\2  \  q  G  W(pi,ritmin)}. 

The  minimum  adequate  radius  across  the  overall  network  is  then  rmin  = 
maxjgjx  rj!min.  The  algorithm  presented  in  [Cortes  et  al.,  2004]  builds 
on  these  observations  to  compute  the  Voronoi  partition  of  a  bounded  set 
generated  by  a  pointset  in  a  distributed  way.  • 


2.2.2  The  locally-cliqueless  graph  of  a  proximity  graph 

Given  a  proximity  graph,  it  is  sometimes  useful  to  construct  another  prox¬ 
imity  graph  that  has  fewer  edges  and  the  same  number  of  connected  compo¬ 
nents.  This  is  certainly  the  case  when  optimizing  multi-agent  cost  functions 
in  which  the  proximity  graph  edges  describe  pairwise  constraints  between 
agents.  Additionally,  the  construction  of  the  new  proximity  graph  should 
be  spatially  distributed  over  the  original  proximity  graph.  Here,  we  present 
the  notion  of  locally-cliqueless  graph  of  a  proximity  graph. 

Let  G  be  a  proximity  graph  in  the  Euclidean  space.  The  locally-cliqueless 
graph  Gic,g  of  Q  is  the  proximity  graph  defined  by:  ( Pi,Pj )  £  Eglcg(V)  if 
(. Pi,Pj )  ^£g{P)  and 

(. PhPj )  e  Eg  EMST  (V'), 

for  any  maximal  clique  V ’  of  ( Pi,Pj )  in  G ■  Figure  2.11  shows  an  illustration 
of  this  notion.  The  properties  of  this  construction  are  summarized  in  the 
following  result,  see  [Ganguli  et  al.,  2007b]  for  the  proof. 
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Figure  2.11  Locally-cliqueless  graph  Qic,gvis  Q  of  the  visibility  graph  C/vis, q  for  the  node 
configuration  shown  in  Figure  2.9(d). 


Theorem  2.12  (Properties  of  the  locally-cliqueless  graph).  Let  Q  be 

a  proximity  graph  in  the  Euclidean  space.  Then,  the  following  statements 
hold. 

(i)  (?EMST,g  Q  G\c,g  Q  G; 

(ii)  g  has  the  same  connected  components  as  Q;  and 

(Hi)  for  Q  =  Gdisk(r),  Gvis,Q,  and  £vis_disk,Q ,  where  r  €  M>0  and  Q  is  an 
allowable  environment,  G\c,g  is  spatially  distributed  over  Q . 

In  general,  the  inclusions  in  Theorem  2. 12(i)  are  strict. 


2.2.3  Proximity  graphs  over  tuples  of  points 

The  notion  of  proximity  graph  is  defined  for  sets  of  distinct  points  V  = 
{pi, . . . ,  Pn} ■  However,  we  will  be  interested  in  considering  tuples  of  elements 
of  X  of  the  form  P  =  (p\, . . .  ,pn),  where  pi  corresponds  to  the  position  of 
an  agent  i  of  a  robotic  network.  In  principle,  note  that  the  tuple  P  might 
contain  coincident  points.  In  order  to  reconcile  this  mismatch  between  sets 
and  tuples,  we  will  do  the  following. 

Let  if  :  Xn  —>  F(X)  be  the  natural  immersion  of  Xn  into  F(X),  that 
is,  if(P)  is  the  point  set  that  contains  only  the  distinct  points  in  P  = 
(pi, . . . , pn ) •  Note  that  ip  is  invariant  under  permutations  of  its  arguments 
and  that  the  cardinality  of  if{pi,  ■ . .  ,pn)  is  in  general  less  than  or  equal  to 
n.  In  what  follows,  V  =  iw(P)  will  always  denote  the  point  set  associated 
to  P  G  Xn.  Using  the  natural  inclusion,  the  notion  of  proximity  graphs  can 
be  naturally  extended  as  follows:  given  Q ,  we  define  (with  a  slight  abuse  of 
notation) 

Q  =  Q  o  i¥  :  Xn  -*•  G(X). 
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Additionally,  we  define  the  set  of  neighbors  map  Mg  :  X  x  Xn  — »  F(A)  by 


Ng(p,  (pi,  •  ■  •  ,pn))  =  Mg(p,i¥{pi, . .  .  ,pn)). 

Note  that,  according  to  this  definition,  coincident  points  in  the  tuple  (pi , . . .  ,pn) 
will  have  the  same  set  of  neighbors.  As  before,  it  is  convenient  to  define  the 
shorthand  notation  MgiP  :  Xn  — >  F(A),  MgiP{P )  =  Mg(p,  P )  for  p  £  X. 


2.2.4  Spatially  distributed  maps 

Given  a  set  Y  and  a  proximity  graph  Q,  a  map  T  :  Xn  — >  is  spatially 
distributed  over  Q  if  there  exists  a  map  T  :  A  xF(X)  — ►  Y,  with  the  property 
that,  for  all  {pi, . . .  ,pn)  G  Xn  and  for  all  j  G  {1, . . . ,  n}, 

Tj(pi,  •  •  .,Pn)  =  T{pj,Mg>Pj{pi,  .  .  .  ,Pn )), 

where  T?  denotes  the  jth-component  of  T.  In  other  words,  the  jth  com¬ 
ponent  of  a  spatially  distributed  map  at  (p\ , . . .  ,pn)  can  be  computed  with 
only  the  knowledge  of  the  vertex  pj  and  the  neighboring  vertices  in  the 
undirected  graph  Q{P). 

When  studying  coordination  tasks  and  coordination  algorithms,  it  will  be 
relevant  to  characterize  the  spatially  distributed  features  of  functions,  vector 
fields  and  set- valued  maps  with  respect  to  suitable  proximity  graphs. 

Remark  2.13  (Relationship  with  the  notion  of  spatially  distributed 
graphs).  Note  that  the  proximity  graph  Q\  is  spatially  distributed  over  the 
proximity  graph  Q2  if  and  only  if  the  map 

P  €  Xn  ~  (MguPl(P), .  . .  ,MguPn(P ))  G  F(A)n 

is  spatially  distributed  over  t/2-  • 


2.3  GEOMETRIC  OPTIMIZATION  PROBLEMS  AND 
MULTICENTER  FUNCTIONS 

In  this  section  we  consider  various  interesting  geometric  optimization  prob¬ 
lems.  By  geometric  optimization,  we  mean  an  optimization  problem  induced 
by  a  collection  of  geometric  objects,  see  [Boltyanski  et  al.,  1999].  We  shall 
pay  particular  attention  to  facility  location  problems,  in  which  service  sites 
are  spatially  allocated  to  fulfill  a  particular  request. 
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2.3.1  Expected-value  multicenter  functions 

Let  S  C  M.d  be  a  bounded  environment  of  interest,  and  consider  a  density 
function  cj)  :  Rrf  — »  R>o-  For  the  discussion  of  this  section,  only  the  value  of 
4>  restricted  to  S  is  of  interest.  One  can  regard  4>  as  a  function  measuring  the 
probability  that  some  event  takes  place  over  the  environment.  The  larger 
the  value  of  the  more  important  the  location  q  is.  We  refer  to  a  non¬ 
increasing  and  piecewise  continuously  differentiable  function  /  :  R>o  — >  R, 
possibly  with  finite  jump  discontinuities,  as  a  performance.  Performance 
functions  describe  the  utility  of  placing  a  node  at  a  certain  distance  from  a 
location  in  the  environment.  The  smaller  the  distance,  the  larger  the  value 
of  /,  i.e. ,  the  better  the  performance.  For  instance,  in  servicing  problems, 
performance  functions  can  encode  the  travel  time  or  the  energy  expenditure 
required  to  service  a  specific  destination.  In  sensing  problems,  performance 
functions  can  encode  the  signal-to-noise-ratio  between  a  source  with  un¬ 
known  location  and  a  sensor  attempting  to  locate  it. 

Given  a  bounded  measurable  set  S  C  Rrf,  a  density  function  (j>,  and  a 
performance  function  /,  let  us  consider  the  expected  value  of  the  coverage 
over  any  point  in  S  provided  by  a  set  of  points  pi, ...  ,pn.  Formally,  we 
define  the  expected-value  multicenter  function  7iex p  :  Sn  — *  R  by 

FfexpOl,- ■  ■  ,Pn)  =  /  max  f(\\q-pi\\2)4>(q)dq.  (2.3.1) 

JSie{l,...,n} 

The  definition  of  7iexp  can  be  read  as  follows:  for  each  location  q  £  S, 
consider  the  best  coverage  of  q  among  those  provided  by  each  of  the  nodes 
pi, . . .  ,pn,  which  corresponds  to  the  value  maxje/lj  f(\\q  —  1 1 2 ) -  Then, 

weigh  the  performance  by  the  importance  (j>(q )  of  the  location  q.  Finally, 
sum  the  resulting  quantity  over  all  the  locations  of  the  environment  S  to 
obtain  7iexp(pi,  ■  ■  ■  ,pn)  as  a  measure  of  the  overall  coverage  provided  by 
Pi,---  ,Pn- 

Given  the  meaning  of  7iex p,  we  seek  to  solve  the  following  geometric  op¬ 
timization  problem: 


maximize  Hexp(pi,  ■  ■  ■ ,Pn ),  (2.3.2) 

that  is,  we  seek  to  determine  a  set  of  configurations  pi,. . .  ,pn  that  maximize 
the  value  of  the  multicenter  function  Ttexp.  An  equivalent  formulation  of  this 
problem  is  referred  to  as  a  continuous  p-median  problem  in  the  literature 
on  facility  location,  e.g.,  see  [Drezner,  1995].  In  our  discussion,  we  will 
pay  special  attention  to  the  case  when  n  =  1,  that  we  term  the  1 -center 
problem.  For  the  purpose  of  solving  (2.3.2),  note  that  we  can  assume  that 
the  performance  function  satisfies  /( 0)  =  0.  This  can  be  done  without  loss 
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of  generality,  since  for  any  c  E  R,  one  has 

/  max  (f{\\q-pih)+c)(t>(q)dq  =  Hexp(p1,...,pn)+cArp(S). 

The  expected- value  multicenter  function  can  be  alternatively  described  in 
terms  of  the  Voronoi  partition  of  S  generated  by  V  =  {pi, . . .  ,pn}-  Let  us 
define  the  set 

Scoinc  =  {(pi,  •  ■  ■  ,Pn)  G  (Kd)ri  I  Pi  =  Pj  for  some  i  /  j}, 

consisting  of  tuples  of  n  points  where  some  of  them  are  repeated.  Then,  for 
{pi, . . .  ,pn)  E  Sn  \  Sc oinc,  one  has 

n  r, 

'Hexp(pi,...,Pn)  =  y2  /(Ik  -  Pih)4>(q)dq.  (2.3.3) 

Mv) 

This  expression  of  7iexp  is  appealing  because  it  clearly  shows  the  result  of 
the  overall  coverage  of  the  environment  as  the  aggregate  contribution  of  all 
individual  nodes.  If  (p\, . . .  ,pn)  S  5coinc>  then  a  similar  decomposition  of 
77exp  can  be  written  in  terms  of  the  distinct  points  V  =  iw(pi,  ■  ■  ■  ,Pn)- 

Inspired  by  the  expression  (2.3.3),  let  us  define  a  more  general  version 
of  the  expected- value  multicenter  function.  Given  (p\ , . . .  ,pn)  6  Sn  and  a 
partition  {W\, . . . ,  Wnj  C  P(5)  of  S,  let 

n  r, 

Hexp{jpi,...,pn,W1,...,Wn)  =  V  /  f(\\q-Pih)<Kq)dq.  (2.3.4) 

i=i  Jw* 

Notice  that  Hexp(pi,  ■  ■  ■  ,pn)  =  Lfexpbi,  •  •  ■  ,Pn,  Vi(V), . . . ,  Vn(V)),  for  all 
(pi, ...  ,pn)  E  Sn  \  5coinc-  Moreover,  one  can  establish  the  following  opti¬ 
mality  result,  see  [Du  et  ah,  1999]. 

Proposition  2.14  (77exp-optimality  of  the  Voronoi  partition).  Let 

V  =  {pi, . . .  ,pn}  E  F(S).  For  any  performance  function  f  and  for  any 
partition  {W\ , . . . ,  Wn}  C  P(S')  of  S, 

Hexp(Pl,-  ■  ■  ,Pn,Vi(V),  .  .  •  ,  Vn(v))  >  Hexp(pi,..  .  ,Pn,Wi,  .  .  .  ,Wn), 

that  is,  the  Voronoi  partition  V(V)  is  optimal  forHexp  among  all  partitions 
ofS. 


Proof.  Assume  that,  for  some  i  j  E  {1 , ...  ,n},  the  set  int(Wj)  flint(V)) 
has  strictly  positive  measure.  For  all  q  E  int(Wj)  nint(V)),  we  know  ||g  — 
Pih  >  \\q~Pjh-  Because  /  is  non-increasing,  f(\\q  -  Pih)  <  f(\\q~Pjh) 
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and,  since  int(Wj)  n  int(V})  has  strictly  positive  measure, 


J  int(Wj)  PI  int(V^) 
Therefore,  we  deduce 


L 


f{h~Pih)^{q)dq  < 


L 


int(Wi)  nint(V}) 


f(h  -  Pjh)Hq)dq- 


n 


f(\\q  -  Pjh)(j>(q)dq, 


Wi  n  Vj 


and  the  statement  follows. 


Different  choices  of  performance  function  give  rise  to  different  expected- 
value  multicenter  functions  with  particular  features.  Let  us  examine  some 
important  cases. 

Distortion  problem:  Consider  as  performance  function  f(x)  =  —  x2 .  Then, 
on  S  \  5Coino  the  expected- value  multicenter  function  takes  the  form 


where  recall  that  J^H7,  p)  denotes  the  polar  moment  of  inertia  of  the 
set  W  about  the  point  p.  In  signal  compression  — Thdistor  is  referred  to 
as  the  distortion  function  and  is  relevant  in  many  disciplines  including 
vector  quantization,  signal  compression,  and  numerical  integration, 
see  [Gray  and  Neuhoff,  1998,  Du  et  ah,  1999].  Here,  distortion  refers 
to  the  average  deformation  (weighted  by  the  density  f> )  caused  by 
reproducing  q  E  S  with  the  location  pi  in  V  =  {p\ , . . .  ,pn}  such  that 
q  G  Vi(V).  It  is  interesting  to  note  that 


n 


^distor  (Pi  ?  •  •  •  iPni  Wli  •  •  •  ?  ^  ^ 


n 


n 


=  ~Y  CM^Wi))  -  Y,  A0(W*)bz  -  CM*(Wi)||i,  (2.3.5) 


i=l 


where  in  the  last  equality  we  have  used  the  Parallel  Axis  Theorem  [Hibbeler, 
2006].  Note  that  the  first  term  only  depends  on  the  partition  of  S, 
whereas  the  second  term  also  depends  on  the  location  of  the  points. 

The  following  result  is  a  consequence  of  this  observation. 

Proposition  2.15  (77distor-0ptiniality  °f  centroid  locations).  Let 

{W\, . . . ,  Wn}  C  P(S)  be  a  partition  of  S.  Then,  for  anyV  =  {pi, . . .  ,pn}  G 
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F  (S), 


Hdistor  (  CM^(tRi),  .  .  .  ,  CM^Wn),  WU  .  .  .  ,  Wn) 

fl  ”^distor(Pli  •  ■  •  iPrn  Wli  ■  ■  ■  j  Wn)i 

that  is,  the  centroid  locations  CM^Wi), . . . ,  CM^Wn)  are  optimal 
for  Hdistor  among  all  configurations  in  S.  Moreover,  if  all  the  sets 
{W\, . . . ,  Wn\  have  non-vanishing  areas,  the  inequality  is  strict  unless 
Pi  =  CM ${Wi)  for  all  i  G  {1, . .  .,n}. 

A  consequence  of  this  result  is  that  for  the  1-center  problem,  i.e. ,  when 
n  =  1,  the  node  location  that  optimizes  p  i— >  'Hdistor(p)  =  —  J<f>(S,p)  is 
the  centroid  of  the  set  S,  denoted  by  CM^(S’); 

Area  problem:  Consider  as  performance  function  f(x)  =  l[o,a]  (x)>  a  £ 
M>o  the  indicator  function  of  the  closed  interval  [0, a].  Then,  the 
expected-value  multicenter  function  takes  the  form 


n  „ 

Harea,a(Pl,  •  ■  •  ,Pn)  =  V  /  l[0,a]  ( II Q  ~  Pi  h)4>{q)dq 

7^1  Jvdv) 


E  / 

i=1  dV.iPin  B(pt,a) 


4>{q)dq 


=  J2  A 4>{Vi{V)  n  B(Pi,a))  =  A0(U 7=1B(Pi,  a)), 
2=1 


that  is,  it  corresponds  to  the  area,  measured  according  to  cj) ,  covered  by 
the  union  of  the  n  balls  B(p\,a), . . .  ,B(pn,  a).  Exercise  E2.4  discusses 
the  1-center  area  problem; 


Mixed  distortion-area  problem:  Consider  as  performance  function  f{x)  = 
— x 2  l[o,a](a:)  +  b  ■  l^+ooK®),  with  a  G  M>o  and  b  <  —a2.  Then,  on 
S  \  <Scoinc,  the  expected- value  multicenter  function  takes  the  form 

n 

^distor-area  ,a,b  (.Pi  >  ■  ■  •  j  Pn )  —  ^  ]  J )iPi)  T  b  A ^(Q  \  U i=\B(jpi,  a)), 

i= 1 

that  is,  it  is  a  combination  of  the  multicenter  functions  corresponding 
to  the  distortion  problem  and  the  area  problem.  Of  special  interest  to 
us  is  the  multicenter  function  that  results  from  the  choice  b  =  —a2. 

In  this  case,  the  performance  function  /  is  continuous,  and  we  simply 
write  77distor-area,a-  The  extension  of  this  function  to  sets  of  points  and 
partitions  of  the  space  reads  as 

77distor-area,a  (jPl,...,Pn,Wl,...,Wn) 
n 

=  ^  E  (  J^Wi  n  «)>Pi)  +  «2  MWi  n  (S  \  B(3H,  a))))  • 

i=l 


114 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


We  leave  the  proof  of  the  following  optimality  result  as  a  guided  exer¬ 
cise  to  the  reader  (see  Exercise  E2.7). 

Proposition  2.16  (77distor-area,a-°ptimality  of  centroid  locations). 

Let  {Wi, . . .  ,Wn}  C  P(S')  be  a  partition  of  S.  Then,  for  any  V  = 
{pi,...,Pn}  e  F (S), 

^distor-area  ,a  ( CM^(^in5(pi,  a)), . .  • ,  CM^{WnnB{pn,  a)),  Wu...,Wn 

—  ^"fdistor  (pi  j  •  •  ■  j  Pm  W± ,  •  ■  ■  ,  Wn). 

Moreover,  if  all  the  sets  {W\, . . . ,  Wn\  have  non-vanishing  areas,  the 
inequality  is  strict  unless  pi  =  CM^  (War'll?  (p*,  a)),  for  alii  £  {1 ,n}. 

A  consequence  of  this  result  is  that  for  the  1-center  problem,  i.e.,  when 
n  =  1,  the  node  location  that  optimizes  p  i— >  ?7distor-area,a (p)  =  J <j>(S  H 
B(p,  a),p )  +  a2  A(j)(S  \  B(p,  a))  is  the  centroid  of  the  set  S  H  B(p,  a), 
denoted  by  CM^(5  n  B(p,  a)). 


Next,  we  characterize  the  smoothness  of  the  expected-value  multicenter 
function.  Before  stating  the  precise  result,  let  us  introduce  some  useful 
notation.  For  a  performance  function  /,  let  Dscn (/)  denote  the  (finite)  set 
of  points  where  /  is  discontinuous.  For  each  a  £  Dscn(/),  define  the  limiting 
values  from  the  left  and  from  the  right,  respectively,  as 

f-(a)=  lim  f(x),  /+(o)  =  lim  f(x). 

x — >a~  x—>a+ 

We  are  now  ready  to  characterize  the  smoothness  of  Ttex p,  whose  proof  is 
given  in  Section  2.5.3.  Before  stating  the  result,  recall  that  the  line  integral 
of  a  function  g  :  R2  — >  R  over  a  curve  C  parameterized  by  a  continuous  and 
piecewise  continuously  differentiable  map  7  :  [0, 1]  — >  M2  is  defined  by 

f  9=1  9(n)dnr-  [  9h(t))\\'y(t)hdt, 

Jc  Jc  Jo 

and  is  independent  of  the  selected  parameterization. 


Theorem  2.17  (Smoothness  properties  of  7iexp).  Given  S  C  bounded 
and  measurable,  a  density  :  R  — >  M>o  and  a  performance  function  f  : 
M>o  — ►  M,  the  expected-value  multicenter  function  Hexp  :  Sn  — >  R  is 

(i)  globally  Lipschitz  on  Sn;  and 


(ii)  continuously  differentiable  on  Sn\Sco-mc,  where  for  each  i  £  {1, . . . ,  n} 


=  /  jr-f(\\Q-Pih)<KQ)dq 

VPi  JVi{V)  OPi 


(2.3.6) 


+  (/-(«)  -  /+(°)) 

aGDscn(/) 


_  ^out  ,B(pi,a) 


!Vi(V)ndB(Pi,a) 


\(q)<t>{q)dq- 
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Therefore,  the  gradient  of  77ex p,  interpreted  as  a  map  from  Sn  to  Mn;  is 
spatially  distributed  (in  the  sense  defined  in  Section  2.2.4)  over  the  Delaunay 
graph  Qu- 

Let  us  discuss  how  Theorem  2.17  particularizes  to  the  distortion,  area, 
and  mixed  distortion-area  problems. 


Distortion  problem.  In  this  case,  the  performance  function  does  not  have 
any  discontinuities,  and  therefore,  the  second  term  in  (2.3.6)  vanishes. 
The  gradient  of  Tfdistor  on  Sn  \  Sco inc  then  takes  the  form,  for  each 
i  €  {1, . .  -,n}, 

=  2A0(Vi(7>))(CM  ^Vi(V))-pi), 

that  is,  the  ith  component  of  the  gradient  points  in  the  direction  of 
the  vector  going  from  pi  to  the  centroid  of  its  Voronoi  cell.  The  critical 
points  of  Tfdistor  are  therefore  the  set  of  centroidal  Voronoi  configura¬ 
tions  in  S  (cf.  Section  2.1.4).  This  is  a  natural  generalization  of  the 
result  for  the  1-center  case,  where  the  optimal  node  location  is  the 
centroid  CM^S1); 


Area  problem.  In  this  case,  the  performance  function  is  differentiable  ev¬ 
erywhere  except  at  a  single  discontinuity,  and  its  derivative  is  identi¬ 
cally  zero.  Therefore,  the  first  term  in  (2.3.6)  vanishes.  The  gradient 
of  77 area, a  on  Sn  \  5COmc  then  takes  the  form,  for  each  i  £  {1 , ,n}, 


dm 


dpi 


'<P)  =  I 


Vi(V)r\dB(pi,a) 


nout 


that  is,  the  zth  component  of  the  gradient  is  an  average  of  the  normal  at 
each  point  of  Vi(V)  n  dB(pi,  a),  see  Figure  2.12  for  an  illustration.  The 
critical  points  of  77area,a  correspond  to  configurations  with  the  property 
that  each  pi  is  a  local  maximum  for  the  area  of  Via(P)  =  V)(.P)  n 
B(pi,a )  at  fixed  Vi(P).  We  refer  to  these  configurations  as  a-limited 
area-centered  Voronoi  configurations.  This  is  a  natural  generalization 
of  the  result  for  the  1-center  case,  where  the  optimal  node  location 
maximizes  A^(5  n  B(p,a ))  (cf.  Exercise  E2.4); 


Mixed  distortion-area  problem.  In  this  case,  the  gradient  of  77distor-area,a,& 
is  a  combination  of  the  gradients  of  Tfdistor  and  77area,a-  Specifically, 
one  has  for  each  i  e  { 1 , . . . ,  n} , 

g77distor-area,a,b(p)  =  2  A^V^V))  (CM^V^V))  -  Pl) 

Opi 

~(a2  +  b)  _  nout5(  )(g)0(g)dg. 

JVi(V)ndB(Pi,a) 
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Figure  2.12  Gradient  of  the  area  function  in  an  environment  with  constant  density  func¬ 
tion.  The  component  of  the  gradient  corresponding  to  the  rightmost  node  is 
zero  -  roughly  speaking,  because  there  is  no  incentive  for  this  node  to  move  in 
any  particular  direction.  However,  the  component  of  the  gradient  for  each  of 
the  three  leftmost  nodes  is  non-zero  -  roughly  speaking,  because  by  moving 
in  the  direction  of  the  gradient,  these  agents  decrease  the  overlapping  among 
the  disk  and  cover  new  regions  of  the  space. 


For  the  particular  case  when  b  =  —a2,  the  performance  function  is 
continuous,  and  the  gradient  of  Hdistor-area,a  takes  the  simpler  form 


flTtdi.slor-a 


dpi 


( P )  =  2A*(Fiia(7>))(CM  ^Vi^VYi-pi), 


which  points  in  the  direction  of  the  vector  from  pi  to  the  centroid  of  its 
a-limited  Voronoi  cell.  In  this  case,  the  critical  points  of  Hdistor-area,a 
are  therefore  the  set  of  a-limited  centroidal  Voronoi  configurations  in 
S  (cf.  Section  2.1.4).  This  is  a  natural  generalization  of  the  result 
for  the  1-center  case,  where  the  optimal  node  location  is  the  centroid 
CM^SnBip,  a)). 


We  refer  to  Hdistor,  7farea ,a,  and  Wdistor-area.a  as  multicenter  functions  be¬ 
cause,  as  the  above  discussion  shows,  their  critical  points  correspond  to 
various  notions  of  center  Voronoi  configurations. 

It  is  important  to  note  that  the  gradients  of  77area,a  and  7fdiStor-area,a,b 
are  spatially  distributed  over  the  2a-limited  Delaunay  graph  Qu^{2a).  This 
observation  is  important  for  practical  considerations:  robotic  agents  with 
range-limited  interactions  cannot  in  general  compute  the  gradient  of  Tfdistor 
because,  as  we  noted  in  Remark  2.11,  for  a  given  r  G  M>o,  I/d  is  not  in  gen¬ 
eral  spatially  distributed  over  t/diskM-  However,  robotic  agents  with  range- 
limited  interactions  can  compute  the  gradients  of  ?7area,a  and  7fdistor-area,a,& 
as  long  as  r  >  2a  because,  from  Theorem  2.7(iii),  I?ld(t)  is  spatially  dis¬ 
tributed  over  f?disk(r)-  The  relevance  of  this  fact  is  further  justified  by  the 
following  result. 

Proposition  2.18  (Constant-factor  approximation  of  Tfdistor)*  Let 
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S  C  Rd  be  bounded  and  measurable.  Consider  the  mixed  distortion- area 
problem  with  a  £  ]0,  diarn S]  and  b  =  —  diam(S')2.  Then,  for  all  P  £  Sn, 

Hdistor-area  ,a,b  (P)  <Hdistor(H)  </32Hdistor  -area,  a,  6  (P)  <  0,  (2.3.7) 

where  (3=^ ^  £  [0, 1], 

In  fact,  similar  constant-factor  approximations  of  the  expected- value  mul¬ 
ticenter  function  Hexp  can  also  be  established,  see  [Cortes  et  al.,  2005]. 


2.3.2  Worst-case  and  disk-covering  multicenter  functions 

Given  a  compact  set  S  C  irf  and  a  performance  function  /,  let  us  consider 
the  point  in  S  that  is  worst  covered  by  a  set  of  points  pi,  ■  ■  ■  ,pn ■  Formally, 
we  define  the  worst-case  multicenter  function  Hworst  :  Sn  — »  M.  by 

HWOrst(pi,  •  •  •  ,Pn)  =  min  max  f(\\q  -  pi\\2).  (2.3.8) 

geS  ie{i,...,n} 

The  definition  of  HWOrst  can  be  read  as  follows:  for  each  location  q  £  S, 
consider  the  best  coverage  of  q  among  those  provided  by  each  of  the  nodes 
pi, . . .  ,pn ,  which  corresponds  to  the  value  maxjgj!  nj.  f(\\q  —  Then, 

compute  the  worst  coverage  Hworst  (pi,  •  •  •  ,Pn)  by  comparing  the  perfor¬ 
mance  at  all  locations  in  S. 

Given  the  interpretation  of  Hworst ,  we  seek  to  solve  the  following  geometric 
optimization  problem: 

maximize  Hworst  (pi,  ■  ■  •  ,Pn),  (2.3.9) 

that  is,  we  seek  to  determine  configurations  pi, . . .  ,pn  that  maximize  the 
value  of  HWorst-  An  equivalent  formulation  of  this  problem  is  referred  to 
as  a  continuous  p-center  problem  in  the  literature  on  facility  location,  see 
e.g.,  [Drezner,  1995]. 

In  the  present  context,  also  relevant  is  the  disk- covering  multicenter  func¬ 
tion  Hdc  :  Sn  — >  M,  defined  by 

Hdc(pi  ,■■■  ,Pn)  =  max  min  1 1  g  —  Pi  1 1 2  •  (2.3.10) 

q&S  ie{ l,...,n} 

The  value  of  Hdc  can  be  interpreted  as  the  largest  possible  distance  from 
a  point  in  S  to  one  of  the  locations  pi,...,pn.  Note  that,  by  definition, 
the  environment  S  is  contained  in  the  union  of  n  closed  balls  centered  at 
pi, . . .  ,pn  with  radius  Hdc(pi>  •  •  •  ,Pn )•  The  definition  of  Hdc  is  illustrated 
in  Figure  2.13(a). 

The  following  result  establishes  the  relationship  between  the  worst-case 
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and  the  disk-covering  multicenter  functions,  and  as  byproduct,  provides  an 
elegant  reformulation  of  the  geometric  optimization  problem  (2.3.9).  Its 
proof  is  left  to  the  reader. 

Lemma  2.19  (Relationship  between  HWorst  and  Tide)-  Given 
compact  and  a  performance  function  f  :  M>o  — ►  M,  one  has  7iwo rst  =  /°Hdc- 

Using  Lemma  2.19  and  the  fact  that  /  is  non-increasing,  we  can  reformu¬ 
late  the  geometric  optimization  problem  (2.3.9)  as 

minimize  Hdcipi,  ■  •  •  ,Pn),  (2.3.11) 

that  is,  find  the  minimum  radius  r  such  that  the  environment  S  is  covered  by 
n  closed  balls  center  at  pi, . . .  ,pn  with  equal  radius  r.  Note  the  connection 
between  this  formulation  and  the  classical  disk-covering  problem:  how  to 
cover  a  region  with  (possibly  overlapping)  disks  of  minimum  radius.  We 
shall  comment  more  on  this  connection  later. 


Given  the  equivalence  between  the  geometric  optimization  problems  (2.3.9) 
and  (2.3.11),  we  focus  our  attention  on  Tide-  The  disk-covering  multicenter 
function  can  be  alternatively  described  in  terms  of  the  Voronoi  partition  of 
S  generated  by  V  =  {pi, . . .  ,pn}.  For  (pi,...,pn)  G  Sn  \  Scoinc ,  one  has 

TidciPi,- ■  ■  ,Pn)  =  max  max  \\q  -  Pih 

ie{ l,...,n}  q&Vi{V) 

=  max  max  \\a  —  odU.  (2.3.12) 

i€{l,...,n}  q£dVi(P) 

This  description  of  Tide  is  illustrated  in  Figure  2.13(b).  The  expression  (2.3.12) 


Figure  2.13  Illustration  of  the  definition  of  Tide-  (a)  and  (b)  show  the  same  configuration, 
with  and  without  the  Voronoi  configuration,  respectively.  For  each  node, 
the  disk  is  the  minimum-radius  disk  centered  at  the  node  and  enclosing  the 
Voronoi  cell.  The  value  of  Tide  is  the  radius  of  the  disk  centered  at  the  leftmost 
node. 
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is  appealing  because  it  clearly  shows  the  value  of  hide  as  the  result  of  the 
aggregate  contribution  of  all  individual  nodes.  If  (p\, . . .  ,pn )  £  tSCOmc>  then 
a  similar  decomposition  of  hide  can  be  written  in  terms  of  the  distinct  points 
V  =  iw(pi,  ■  ■  ■ , Pn)-  A  node  i  £  {1, . . . ,  n}  is  called  active  at  (p\, . . .  ,pn )  if 
rna xqedVi(V)  \\q~Pi\\2  =  hidc(pi,  ■  ■  ■ ,Pn )•  A  node  is  passive  at  (pi,  ...,pn)  is 
it  is  not  active. 

Inspired  by  the  expression  (2.3.12),  let  us  define  a  more  general  version  of 
the  worst-case  multicenter  function.  Given  (p\. . . .  ,pn )  £  Sn  and  a  partition 
{Wi,...,Wn}  C  P(S)  of  S,  let 

hidc(pi,  •••,  Pn,  Wi, ...,  Wn)  =  max  max  ||g -p*||2- 

ie{l,...,n}  q£dWi 

Notice  that  Hdc(pi,  •  •  •  ,Pn )  =  77dc(pi,  •  ■  -,Pn,Vi(V), . .  .,Vn(V)),  for  all  (pi, . . .  ,pn )  £ 
Sn \ cScoinc •  Moreover,  one  can  establish  the  following  optimality  result  whose 
result  is  given  in  Section  2.5.4. 

Proposition  2.20  (77dc-°ptimality  of  the  Voronoi  partition  and  cir- 
cumcenter  locations).  For  any  V  =  {p\, . . .  ,pn}  £  F(S)  and  any  parti¬ 
tion  {Wi, ...  ,Wn}  CP(S)  ofS, 

Hdc(pi,  ■  ■  ■  ,Pn,  Vl(V),  .  .  .,Vn(V))  <  Hdc(pi,  ■  ■  •  ,Pn,  Wl,  .  .  .  ,  Wn), 

that  is,  the  Voronoi  partition  V(V)  is  optimal  for  hide  among  all  partitions 
of  S,  and 

ndc(cc(Wi), . . . , cc(wn), Wi,...,wn)<  Hdc(pi, ...,pn,Wi,...,wn), 

that  is,  the  circumcenter  locations  CC(Wi), . . . ,  CC (Wn)  are  optimal  for  Tide 
among  all  configurations  in  S. 

As  a  corollary  of  this  result,  we  have  that  the  circumcenter  of  S'  is  a  global 
optimum  of  hide  f°r  the  1-center  problem,  i.e.,  when  n  =  1.  This  comes  at 
no  surprise  since,  in  this  case,  the  value  7idc(p)  corresponds  to  the  radius  of 
the  minimum-radius  sphere  centered  at  p  that  encloses  S. 

The  following  result  characterizes  the  smoothness  properties  of  the  disk¬ 
covering  multicenter  function,  see  [Cortes  and  Bullo,  2005]  for  more  details 
and  for  the  proof. 

Theorem  2.21  (Smoothness  properties  of  hide)-  Given  S  C  Wl  com¬ 
pact,  the  disk-covering  multicenter  function  Tide  '■  Sn  —>  R  is  globally  Lips- 
chitz  on  Sn. 

The  generalized  gradient  and  the  critical  points  of  hide  can  be  charac¬ 
terized,  but  require  a  careful  study  based  on  nonsmooth  analysis  [Clarke, 

1983].  In  particular,  two  facts  taken  from  [Cortes  and  Bullo,  2005]  are  of  in- 
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terest  here.  First,  under  certain  technical  conditions,  one  can  show  that  the 
critical  points  of  Tide  are  circumcenter  Voronoi  configurations.  This  is  why 
we  refer  to  Tide  as  a  multicenter  function.  Second,  the  generalized  gradient 
of  Hdc  is  not  spatially  distributed  over  C/d-  This  is  essentially  due  to  the 
inherent  comparison  among  all  agents  that  is  embedded  in  the  definition  of 
Tide  (via  the  max  function) . 


2.3.3  Sphere-packing  multicenter  functions 

Given  a  compact,  connected  set  S  C  Md,  consider  the  situation  where  one 
seeks  to  cover  S  as  much  as  possible  by  means  of  open  balls  centered  at  a  set 
of  points  pi, ...  ,pn  with  equal  radius  in  such  a  way  that  these  regions  do  not 
intersect  or  leave  the  environment.  Formally,  we  define  the  sphere-packing 
multicenter  function  7isp  :  Sn  — >  M  by 

Hsp(pi,...,pn)  =  min  {  \ \\Pi  -  Pj  ||2,  dist(pj,  <95)|.  (2.3.13) 

The  definition  of  7isp  can  be  read  as  follows:  consider  the  pairwise  distances 
between  any  two  points  pi,  pj  (multiplied  by  a  factor  1/2  so  that  each  point 
can  fit  a  ball  of  equal  radius  and  these  balls  do  not  intersect),  and  the  indi¬ 
vidual  distances  from  each  point  to  the  boundary  of  the  environment.  The 
value  of  Ttsp  is  then  the  smallest  of  all  distances,  guaranteeing  that  the  union 
of  n  open  balls  centered  at  p±, . . .  ,pn  with  radius  Ttsp(p±, . . .  ,pn )  is  disjoint 
and  contained  in  S.  The  definition  of  7isp  is  illustrated  in  Figure  2.14(a). 


Figure  2.14  Illustration  of  the  definition  of  7 lsp.  (a)  and  (b)  show  the  same  configuration, 
with  and  without  the  Voronoi  configuration,  respectively.  For  each  node,  the 
disk  is  the  maximum-radius  disk  centered  at  the  node  and  contained  in  the 
Voronoi  cell.  The  value  of  7fsp  is  the  radius  shared  by  the  two  smallest  disks. 
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Given  the  interpretation  of  Hsp,  we  seek  to  solve  the  following  geometric 
optimization  problem: 

maximize  HSp(pi, . .  .,pn),  (2.3.14) 

that  is,  we  seek  to  determine  configurations  p\, . . .  ,pn  that  maximize  the 
value  of  7isp.  Note  the  connection  of  this  formulation  with  the  classical 
sphere-packing  problem:  how  to  maximize  the  coverage  of  a  region  with 
non-overlapping  disks  (contained  in  the  region)  of  maximum  radius.  We 
shall  comment  more  on  this  connection  later. 

The  sphere-packing  multicenter  function  can  be  alternatively  described 
in  terms  of  the  Voronoi  partition  of  S  generated  by  V  =  {p\ , . . .  ,pn}.  For 
(pi, . . .  ,pn)  £  Sn  \  cScoinc,  one  has 

7isp{pi, . . .  ,pn)  =  min  min  \\q  -  Pi\\2.  (2.3.15) 

ie{ l,...,n}  q£dVi(P) 

This  description  is  illustrated  in  Figure  2.14(b).  As  for  the  previous  multi¬ 
center  functions,  the  expression  (2.3.15)  is  appealing  because  it  clearly  shows 
the  value  of  of  7isp  as  the  result  of  the  aggregate  contribution  of  all  individ¬ 
ual  nodes.  If  (pi, . . .  ,pn)  £  tSCOmc>  then  a  similar  decomposition  of  Hsp  exists 
in  terms  of  the  distinct  points  V  =  *f(pi,  ■  ■  •  ,pn)-  A  node  i  £  {1, . . .  ,n} 
is  called  active  at  (pi, . . .  ,pn)  if  eamqGdVi^  I \q  ~  Pih  =  FGP(pi,  •  •  ■  ,Pn)-  A 
node  is  passive  at  (pi, . . .  ,pn )  is  it  is  not  active. 

Inspired  by  the  expression  (2.3.15),  let  us  define  a  more  general  version  of 
the  worst-case  multicenter  function.  Given  (pi, . . .  ,  pn)  £  Sn  and  a  partition 
{Wi, . . . ,  C  P(S)  of  S ,  let 

7iSp(pi, . . .  ,Pn,  Wi, . . . ,  Wn)  =  min  min  ||g  -  pi\\2. 

ie{l,...,n}  q&dWi 

Notice  that  7isp(pi,  ...,pn)  =  7isp(pi, . . .  ,pn,  V Vn(V)),  for  all  (pi, . . .  ,pn)  G 
5" \ cScoinc  Also  note  that  the  value  Hsp(qi, . . .  ,qn,  Wi, . . . ,  Wn )  is  the  same 
for  any  %  £  IC(lFj),  i  £  {1, . . . ,  n}.  With  a  slight  abuse  of  notation,  we  refer 
to  this  common  value  as  77sp(IC(l/Fi), . . . ,  IC (Wn),  W\, . . . ,  Wn).  Moreover, 
one  can  establish  the  following  optimality  result  (see  Section  2.5.5  for  the 
proof). 

Proposition  2.22  (Hsp-optimality  of  the  Voronoi  partition  and  in¬ 
center  locations).  For  any  V  =  {pi,...,pn}  £  F(S')  and  any  partition 
{Wi, . . . ,  Wn}  C  P(5)  ofS, 

7tsp(pi;  •  ■  •  )Pn,  V± (P),  ■  ■  ■ ,  Vn(V)')  >  Ti.Sp(pi,  ■  ■  ■ , Pm  FT  i , . . . ,  TFn), 

that  is,  the  Voronoi  partition  V(V)  is  optimal  for  HSp  among  all  partitions 
of  S,  and 

w8p(ic(wi), . . . ,  ic(wn),  wu . . . ,  Wn)  >  nSp(p  1, . . .  ,Pn,  Wi, . . . ,  wn), 
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that  is,  the  incenter  locations  IC(IFi), . . . ,  IC(W„)  are  optimal  for  7isp  among 
all  configurations  in  S. 

As  a  corollary  of  this  result,  we  have  that  the  incenter  set  of  S  is  composed 
of  global  optima  of  Ttsp  for  the  1-center  problem,  i.e. ,  when  n  =  1.  This 
comes  at  no  surprise  since,  in  this  case,  the  value  7isp(p)  corresponds  to  the 
radius  of  the  maximum-radius  sphere  centered  at  p  enclosed  in  S. 

The  following  result  characterizes  the  smoothness  properties  of  the  sphere¬ 
packing  multicenter  function,  see  [Cortes  and  Bulk),  2005]. 

Theorem  2.23  (Smoothness  properties  of  Tisp).  Given  S  C  com¬ 
pact,  the  sphere-packing  multicenter  function  Ttsp  :  Sn  — *  M  is  globally  Lip- 
schitz  on  Sn . 

We  conclude  the  section  with  some  remark  that  are  analogous  to  the  ones 
for  the  function  Tide-  The  generalized  gradient  and  the  critical  points  of  TLsp 
can  be  characterized,  but  require  a  careful  study  based  on  nonsmooth  anal¬ 
ysis  [Clarke,  1983].  In  particular,  two  facts  taken  from  [Cortes  and  Bullo, 
2005]  are  of  interest  here.  First,  under  certain  technical  conditions,  one 
can  show  that  the  critical  points  of  7isp  are  incenter  Voronoi  configurations. 
This  is  why  we  refer  to  TLsp  as  a  multicenter  function.  Second,  the  general¬ 
ized  gradient  of  7isp  is  not  spatially  distributed  over  Q q.  This  is  essentially 
due  to  the  inherent  comparison  among  all  agents  that  is  embedded  in  the 
definition  of  TLsp  (via  the  min  function) . 

2.4  NOTES 

A  thorough  introduction  to  computational  geometric  concepts  can  be  found 
in  [Preparata  and  Shamos,  1993,  de  Berg  et  ah,  2000,  O’Rourke,  2000].  The 
handbooks  [Goodman  and  O’Rourke,  2004,  Sack  and  Urrutia,  2000]  present 
a  comprehensive  overview  of  computational  geometric  problems  and  their 
applications.  Among  the  numerous  topics  we  do  not  discuss  in  this  chapter, 
we  mention  distance  geometry  and  rigidity  theory  [Whiteley,  1997]  that  are 
notable  for  their  applications  to  network  localization  and  formation  control. 

The  notion  of  Voronoi  partition,  and  generalizations  of  it,  have  been  ap¬ 
plied  in  a  numerous  areas,  including  spatial  interpolation,  pattern  analysis, 
spatial  processes  modeling,  and  optimization,  to  name  a  few.  The  sur¬ 
vey  [Aurenhammer,  1991]  and  the  book  [Okabe  et  ah,  2000]  discuss  the  his¬ 
tory,  properties,  and  applications  of  Voronoi  partitions.  The  nearest  neigh¬ 
bor  and  natural  neighbor  interpolations  based  on  Voronoi  partitions,  e.g., 
see  [Sibson,  1981,  Boissonnat  and  Cazals,  2002],  are  of  particular  interest 
to  the  treatment  of  this  chapter  because  of  their  spatially-distributed  corn- 
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putation  character.  Spatially-distributed  maps  for  motion  coordination  are 
discussed  in  [Martinez  et  al.,  2007c]  and  adopted  in  later  chapters. 

Proximity  graphs  [Jaromczyk  and  Toussaint,  1992]  are  a  powerful  tool  to 
capture  the  structure  and  shape  of  geometric  objects,  and  therefore,  have 
applications  in  multiple  areas,  including  topology  control  of  wireless  net¬ 
works  [Santi,  2005],  computer  graphics  [Langetepe  and  Zachmann,  2006], 
and  geographic  analysis  [Radke,  1988].  The  connectivity  properties  of  cer¬ 
tain  proximity  graphs  (including  the  ones  stated  in  Theorem  2.9)  are  taken 
from  [Cortes  et  ah,  2005,  2006].  In  cooperative  control,  a  closely  related 
notion  is  that  of  state-dependent  graph  [Mesbahi,  2005].  Random  geometric 
graphs  [Penrose,  2003]  and  percolation  theory  [Bollobas  and  Riordan,  2006, 
Meester  and  Roy,  2008]  study  the  properties  of  proximity  graphs  associated 
to  the  random  deployment  of  points  according  to  some  specified  density 
function. 

Locational  optimization  problems  [Drezner,  1995,  Drezner  and  Hamacher, 
2001]  are  spatial  resource-allocation  problems  (e.g.,  where  to  place  mail¬ 
boxes  in  a  city,  where  to  place  cache  serves  on  the  internet)  that  pervade 
a  broad  spectrum  of  scientific  disciplines.  Computational  geometry  plays 
an  important  role  in  locational  optimization  [Robert  and  Toussaint,  1990, 
Okabe  et  ah,  2000].  The  field  of  geometric  optimization  [Mitchell,  1997, 
Agarwal  and  Sharir,  1998,  Boltyanski  et  ah,  1999]  blends  the  geometric  and 
locational  optimization  aspects  to  study  a  wide  variety  of  optimization  prob¬ 
lems  induced  by  geometric  objects.  The  smoothness  properties  of  the  cost 
function  7iex p  are  taken  from  [Cortes  et  ah,  2005]. 

2.5  PROOFS 

This  section  gathers  the  proofs  of  the  main  results  presented  in  the  chapter. 


2.5.1  Proofs  of  Theorem  2.7  and  Theorem  2.9 

Proof  of  Theorem  2. 7.  The  inclusions  in  fact  (i)  are  taken  from  [Jaromczyk  and  Toussaint, 

1992,  de  Berg  et  ah,  2000].  The  proof  of  the  first  inclusion  in  fact  (ii)  is  as 

follows.  Let  ( Pi,Pj )  E  £gGn6disk(r)(n  From  the  definition  of  the  Gabriel 

graph,  we  deduce  that  ||p*+Pj  —  T'i  1 1 2  =  HP*^Pj  —  Pj\\ 2  <  |[  Pl~^P3  —  ||2,  for 

all  k  E  {l,...,n}  \  {i,j},  and  therefore,  p'12Pj  E  Vj('P)  fl  Vj(V).  Since 

(■ Pi,Pj )  E  £gdisk(r) (T7),  we  deduce  that  p'f2Pj  £  B(pi,  |)  n  |),  and  hence 

{PiiPj)  £  £gi,i>(r){'P)-  The  second  inclusion  in  (ii)  is  straightforward:  if 

(Pi,Pj)  e  £gLO(r)(P),  then  ViifP)  Cl  Vj(V)  /  0,  i.e.,  ( Pi,Pj )  E  SgD(V).  Since 

clearly  (pi,pj)  E  £gdisk(r)(P),  we  conclude  (ii).  ■ 
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Proof  of  Theorem  2.9.  The  proof  of  fact  (i)  is  as  follows.  Let  V  £  F(Md). 
If  ^emst(T’)  C  C/diskMC^),  then  clearly  Qdisk{r){V)  is  connected.  To  prove 
the  other  implication,  we  reason  by  contradiction.  Assume  t/disk(?’)(T’)  is 
connected  and  let  t?EMST(T’)  $2  Qd\sk{f){B),  i.e. ,  there  exists  pt  and  pj 
with  ( pi,pj )  £  £gEMST (fP~)  and  \\pi  —  pj || 2  >  r.  If  we  remove  this  edge 
from  £gEMST(V),  then  the  tree  becomes  disconnected  into  two  connected 
components  T±  and  T2,  with  pi  £  T\  and  pj  £  T2.  Now,  since  by  hy¬ 
pothesis  t?disk(?" ) (fP )  is  connected,  there  must  exist  k,l  £  {l,...,n}  such 
that  pk  £  T\ ,  pi  £  T2  and  |j pk  -  P1W2  <  r.  If  we  add  the  edge  (pk,Pi) 
to  the  set  of  edges  of  T\  L)  T2,  then  the  resulting  graph  G  is  acyclic,  con¬ 
nected,  and  contains  all  the  vertices  V ,  i.e.,  G  is  a  spanning  tree.  Moreover, 
since  || pj.  —  pi\\2  <  r  <  \\pi  —  Pj\\2  and  T\  and  T2  are  induced  subgraphs  of 
^emst(T’),  we  conclude  that  G  has  smaller  length  than  ^emst(T’),  which 
is  a  contradiction  with  the  definition  of  the  Euclidean  minimum  spanning 
tree. 

Let  us  prove  fact  (ii).  For  r  £  R+,  it  is  enough  to  show  that  <7emst  H  ^disk(^) 
has  the  same  connected  components  as  f?disk(0>  since  this  implies  that 
the  same  result  holds  for  <7rn  H  ^disk(^),  H  C/diskM,  ancl  £ld(0-  Since 
Hemst  H  f/diskM  is  a  subgraph  of  t/diskM)  if  is  clear  that  vertices  belong¬ 
ing  to  the  same  connected  component  of  <5emst  H  <5disk(0  must  also  belong 
to  the  same  connected  component  of  t/disk(0-  To  prove  the  converse,  let 
V  £  and  assume  pi  and  pj  in  V  verify  || pi  —  pj\\2  <  r.  Let  C  be  the 

connected  component  of  fJdiskMO3)  to  which  they  belong.  With  a  slight 
abuse  of  notation,  we  also  denote  by  C  the  vertices  of  the  connected  com¬ 
ponent.  Since  C  is  connected,  then  ^emst(C)  C  C  by  fact  (i).  Moreover, 
since  all  the  nodes  in  V  \  C  are  at  a  distance  strictly  larger  than  r  from 
any  node  of  C,  we  deduce  from  the  definition  of  the  Euclidean  minimum 
spanning  tree  that  <7emst(C)  is  equal  to  the  subgraph  of  ^emst(T’)  induced 
by  C.  Therefore,  f/EMS'dC')  C  £emst  H  Gdisk(r)(V),  and  pi  and  pj  belong  to 
the  same  component  of  <7emst  H  f/disk(r)(T’).  This  implies  the  result.  ■ 


2.5.2  Proof  of  Proposition  2.10 

Proof.  Regarding  the  statement  on  I/rn  H  £/disk(0>  note  that 

B{pi,  \\pi  -pj\\2)nB(pj,  Wpi-pjh)  C  B(pi,  \\pi-pj H2). 

Therefore,  if  \\pi  —  Pj\\2  <  r,  then  any  node  contained  in  the  intersection 
B{pi,  || Pi  —  pj\\2)  C\  B(pj,  || pi  —  pj  H2)  must  necessarily  be  within  a  distance 
r  of  pi .  From  here,  we  deduce  that  C/rn  n  f?disk(r)  is  spatially  distributed 
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over  t^disk  (V) .  Regarding  the  statement  on  Qq  n  ^disk(^),  note  that 


B 


Pi+Pj  \\Pi~Pjh 


C  B(ph  \\pi-pjh)- 


Therefore,  if  \\pi  —  pj\\r2  <  r,  then  any  node  contained  in 
must  necessarily  be  within  a  distance  r  of  pi.  From  here,  we  deduce  that 
Qg  O  ^disk(^)  is  spatially  distributed  over  {JdiskM-  Finally,  note  that  if  \\pi  — 
Pj  || 2  >  r,  then  the  half  plane  {q  E  R2  |  || q  —  pi\\2  <  ||g  —  Pj  || 2}  contains  the 
ball  B  (pi ,  | ) .  Accordingly, 

vir(v)  =  vl(v)n  s(pi,§) 

=  {g  6  R2  |  ||<?  —  Pi 1 1 2  <  Ik  -Pjlb,  for  all  Py  £P}n  B(pi,  2) 

=  {?  e  R2  I  Ik-Pilb  <  Ik-Pjlb,  for  all  pj  eA/gdisk(r)iPi(F)}nB(pi,  §), 

from  where  we  deduce  that  £/ld(0  is  spatially  distributed  over  t/diskM-  H 


2.5.3  Proof  of  Theorem  2.17 

We  begin  with  some  preliminary  notions.  In  the  following,  a  set  12  C  R2 
is  piecewise  continuously  differentiable  if  its  boundary,  <912,  is  a  not  self- 
intersecting  closed  curve  that  admits  a  continuous  and  piecewise  continu¬ 
ously  differentiable  parameterization  7  :  [0, 1]  — >  R2.  Likewise,  a  collection 
of  sets  {12(x)  C  R2  |  x  E  (a,  b)}  is  a  piecewise  continuously  differentiable 
family  if  Q(x)  is  piecewise  continuously  differentiable  for  all  x  €  ( a,b ),  and 
there  exists  a  continuous  function  7  :  [0,1]  x  ( a,b )  — >  M2,  (t,x)  1— >  7 (t,x), 
continuously  differentiable  with  respect  to  its  second  argument,  such  that 
for  each  x  £  (a,  b ),  the  map  1 1— >  7x(f)  =  7(2,  x)  is  a  continuous  and  piecewise 
continuously  differentiable  parameterization  of  <9 12(x).  We  refer  to  7  as  a 
parameterization  for  the  family  {f2(x)  C  R2  |  x  £  (a,  b)}. 

The  following  result  is  an  extension  of  the  integral  form  of  the  Conservation- 
of-Mass  Law  in  fluid  mechanics  [Chorin  and  Marsden,  1994]  and  of  the  clas¬ 
sic  divergence  theorem  in  differential  geometry  [Chavel,  1984]. 

Proposition  2.24  (Generalized  conservation  of  mass).  Let  {O(m)  C 
M2  |  x  £  (a,  6)}  be  a  piecewise  continuously  differentiable  family  such  that 
Ll(x)  is  star-shaped  for  all  x  £  (a,  6).  Let  the  function  <p  :  M2  x  (a,  b)  — >  R 
be  continuous  on  M2  x  ( a,b ),  continuously  differentiable  with  respect  to  its 
second  argument  for  all  x  £  (a,  b)  and  almost  all  q  £  f2(x),  and  such  that 
for  each  x  £  (a,  b),  the  maps  q  1— >  4>(q,x)  and  q  1— >  x)  are  measurable, 
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and  integrable  on  £l(x).  Then,  the  function 

(a,b)  3  x  i— >  /  (f>(q,x)dq  (2.5.1) 

J  Q,(x) 

is  continuously  differentiable  and 

~r~  [  4>(q,x)dq=  I  jf~(q,x)dq+  [  ,  x)  (n(j)  ■  dj  , 

dx  Jn(x)  Jq(x)  ox  Jda{x)  \  dxJ 

where  n  :  dfl(x)  —>  R2,  q  i— >  n(q),  denotes  the  unit  outward  normal  to  dQ(x) 
at  q  E  dVL{x),  and  7  :  [0, 1]  x  (a,  b )  — »  R2  is  a  parameterization  for  the  family 
{fl(x)  C  R2  |  x  E  (a,  b)}. 

We  interpret  the  proposition  as  follows:  in  the  fluid  mechanics  interpreta¬ 
tion,  as  the  parameter  x  changes,  the  total  mass  variation  inside  the  region 
can  be  decomposed  into  two  terms.  The  first  term  is  the  amount  of  mass 
created  inside  the  region,  whereas  the  second  term  is  the  amount  of  mass 
that  crosses  the  moving  boundary  of  the  region. 


Proof  of  Proposition  2.2f.  Let  xq  E  (a,b).  Using  the  fact  that  the  map 
7  is  continuous  and  that  U(xo)  is  star-shaped,  one  can  show  that  there 
exist  an  interval  around  xq  of  the  form  (xq  —  £,xq  +  e),  a  continuously 
differentiable  function  uXo  :  [0, 1]  x  R>o  — >  R2  and  a  function  rXo  :  [0, 1]  x 
(xo  —  £,  xo  +  e)  — >  R>o  continuously  differentiable  in  its  second  argument 
and  piecewise  continuously  differentiable  in  its  first  argument,  such  that  for 
all  x  E  (xo  —  £,  xo  +  e),  one  has 


fl(x)  =  Ute[0}1]{uXo(t,s)  |  0  <  s  <  rXo(t,x)}, 
-j(t,x)  =  uXo(t,rXo(t,x)),  for  all  t  E  [0, 1]. 


For  simplicity,  we  denote  by  r  and  u  the  functions  rXo  and  uXo ,  respectively. 
By  definition,  the  function  in  (2.5.1)  is  continuously  differentiable  at  xo  if 
the  following  limit  exists: 


lim  — 

h^o  h 


Q(xo+h) 


f{q,xo  +  h)dq 
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and  depends  continuously  on  xq.  Now,  we  can  rewrite  the  previous  limit  as 


lim  —  [ 
h— >0  h  J o 


r(t,xQ+h) 


1 


=  lim 
/i— >o  h  J  o 

rr(t,x0 ) 


f 

JO 


l 


s),  x0  +  h ) 

r(t,x  0) 

0(u(t,s),xo) 


du  du 
dt  ds 
du  du 
dt  ds 


ds 


ds  )dt 


r(t,x0+h) 


r(t,x  0) 


S ),  Xq  +  h) 


du  du 
dt  ds 


ds 


+ 


rr{t,x0)  qu  qu 

J  ((j)(u(t,  s),x0  +  h)  -  c/)(u(t,  s),x0))  x  —  2 dsjdt ,  (2.5.2) 


where  x  denotes  the  vector  product  and  for  brevity  we  omit  that  the  partial 
derivatives  and  ^  are  evaluated  at  (f,  s )  in  the  integrals.  Regarding  the 
second  integral  in  the  last  equality  of  (2.5.2),  since 


lim  — 
fr->o  h 


u(t,  s),x0  +  h)~  <t>{u{t,  s),xQ)) 


du  du 
dt  ds 


d(f>  ,  ,  .  . 

=  a—{u(t,s),x0) 
dx  o 


du  du 
dt  ds 


almost  everywhere  and  this  function  is  measurable  and  its  integral  over 
the  bounded  set  D(x o)  is  finite  by  hypothesis,  the  Lebesgue  Dominated 
Convergence  Theorem  [Bartle,  1995]  implies  that 


1  pi  rr(t,x o) 

^hJoJo 


(Hu(t,  s),XQ  +  h)  -  <j>(u{t,  s),  x0)) 


du  du 
dt  ds 


dsdt 


1  pr(t, Xo)  ax 

^{u(t,s),x0) 


0  J  0 


/ 

j  fl(x o) 


du  du 
dt  ds 


dsdt 


q,x0)dq . 


(2.5.3) 


On  the  other  hand,  regarding  the  first  integral  in  the  last  equality  of  (2.5.2), 
using  the  continuity  of  0,  one  can  deduce  that 


1  /•!  rr(t,x0+h) 


(j>(u(t,s),xo  +  h)  ^(t,s)x^(t,s) 

( ty 


h-^0  h 


-,x0) 


du 

w 


du 
ds  1 


ds  dt 


2  pi  pxo+h 

lim  -  /  /  (f>(u(t,r(t,  z)),xq  +  h) 

h^°  h  Jo  Jx0 


r)n  mi  ht 

—  (t,  r(t,  z))  x  —(t,  r(t,  z))  z )  dz  dt 

f  ^  du  du  dv 

=  j  cj>(u(t,r(t,x o)),x0)  —(t,r(t,x0))x—(t,r(t,x0))  ^-^-(t,x0)  dt. 
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Since  7 (t,  x )  =  u(t,  r(t,  x))  for  all  t  G  [0, 1]  and  x  G  {xq  —  e,xg  +  e),  one  has 


d'y 

dt 


{t,x  0) 


d'y 

dx 


(t,x  0) 


HI  /  H  7  /  //r* 

—  (t,r(t,x0))  +  -^(t,r(t,xo))  —  {t,x0) 

nil  nv 

—  (t,r(t,xo))  —  (t,xQ). 


Let  x  denote  the  angle  formed  by  ^(t,x 0)  and  jff(t,r(t,x 0)).  Then  (omit¬ 
ting  the  expression  (t,r(t,x))  for  brevity), 


du  du 
dt  ds  2 


du  du  dr 
dt  ds  dt 


du 

ds 


d'y 


dt 


du 

d'y 

2 

ds 

sin  y  = 

2 

dt 

nT(y) 


du 
ds  ’ 


where  in  the  last  inequality  we  have  used  the  fact  that,  since  'yXo  is  a  param¬ 
eterization  of  dQ(xo),  then  siny  =  cosip,  where  if  is  the  angle  formed  by  n, 
the  outward  normal  to  5f2(a;o),  and  §7-  Therefore,  we  finally  arrive  at 


r 1  r)ii  (I'll  ()r 

J  4>{'y{t),x0)  —(t,r(t,x0))x—(t,r(t,x 0))  ^(t,x0)dt 

d'y  d'y 

=  Jo  ~dt^t,x°'>  2nT(^(*>x°))^(f’xo)^ 


L 


^(7,x0)nT(7)^d7. 


9f2(a;o) 


(2.5.4) 


Given  the  hypothesis  of  Proposition  2.24,  both  terms  in  (2.5.3)  and  (2.5.4) 
have  a  continuous  dependence  on  xg  G  ( a,b ).  This  concludes  the  proof.  ■ 


We  are  finally  ready  to  state  the  proof  of  the  main  result  of  Section  2.3. 


Proof  of  Theorem  2. 1 7.  We  prove  the  theorem  statement  when  the  perfor¬ 
mance  function  is  continuously  differentiable  and  we  refer  to  [Cortes  et  al., 
2005]  for  the  complete  proof  for  the  case  when  the  performance  function 
is  piecewise  continuously  differentiable.  Specifically,  we  show  that  if  /  is 
continuously  differentiable,  then  for  P  G  Sn  \  Sco inc, 


dHex  p 

dpi 


/(Ik  -  Pih)f>{q)dq. 
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From  Proposition  2.24,  we  have 


f  — 

JVi(V) 


f(h  ~  Pih)(t>(q)dq 


<p(Pj,q){n(7j)  ■  ^ 


dj 


ji 


where  7 j  is  a  parametrization  of  Vj(V)  and  where  we  abbreviate  <p(j)j,q )  = 
/(Ik  —  Pj  IbD'/H?)-  Next,  we  show  that  the  second  term  vanishes.  Note  that 
the  motion  of  pi  affects  the  Voronoi  cell  Vi(V)  and  the  cells  of  all  its  neighbors 
in  NgDyPi(V).  Therefore,  the  second  term  equals 


<fi(Pi,q)(n(  7i)-p^)dji+  Y 


IdVi  (V) 


dpi 


v^Pi{v)J9VAV) 


dpi 


Without  loss  of  generality  assume  Vi(V)  does  not  share  any  face  with  dS. 
Since  the  boundary  of  Vi(V)  satisfies  dVi(V)  =  (Jy  A y,  where  A ^  =  A ji  is 
the  edge  between  Vi(V)  and  Vj(V),  for  all  neighbors  pj,  we  compute 


ldVi(P) 


v(pi,q)(n(ji)  •  =  Y  i  <p(pi’(i)(nij('yj)  '7r)d'yji 

Pi  P^eD,Pi(V)J^ 

Lm‘t’iPl'q)(nhl) '  wdli  = 


where  riij  denotes  the  unit  normal  along  Atj  outward  of  Vi(P).  Noting  that 
riji  =  —  riij  and  collecting  the  results  obtained  so  far,  we  write 


•=1  JdVAV) 


dpi 


E 


PjSA/gD,Pi (V) ' 


> q)  -  p{pj  ,q))[  nij  (7 j  )m*fyi)drt3- 


This  quantity  vanishes  because  f(\\q  —  P1W2)  =  /(Ik  —  Pjlk),  and  therefore, 
ip(pi,q)  =  <p(pj,q)  for  any  q  belonging  to  the  edge  A y.  ■ 


2.5.4  Proof  of  Proposition  2.20 

Proof.  Recall  that  Hdc(pi,  •  •  • , Pn )  =  Ti-dcipi,  ■  •  •  ,Pn,  Vi (V), . . . ,  Vn(V)).  To 
show  the  first  inequality,  let  j  E  { 1 , . . . ,  n}  and  q *  E  Vj(V)  be  such  that 
7ddc(Pu  ■■■  ,Pn)  =  \\q*  ~ Pjh-  By  definition,  given  a  partition  {Wi, . . . ,  Wnj 
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of  S,  there  exists  k  such  that  g*  £  Wk-  Therefore, 

WdcOl ,---,Pn)  =  Ik*  ~Pjh  <  Ik*  ~Pkh 

<  max  |k-Pj||2  <  'Hdcipi,  •  ■  ■  ,Pn,Wi, . . .  ,Wn). 

q&Wk 

To  show  the  second  inequality,  note  that  the  definition  of  circumcenter  im¬ 
plies  that,  for  each  i  £  {1, . . . ,  n}, 

max  ||g  -  CC(Wj)||2  <  max  |  k  —  Pi  1 1 2  - 

q&OWi  qGdWi 

Taking  the  maximum  over  all  nodes,  we  deduce 
7idc(CC(TTi), . . . ,  CC(Wn),  Wi, . . . ,  Wn)  <  Hdc(pi, . . .  ,pn,  Wi,  . . . ,  Wn), 
as  claimed.  ■ 


2.5.5  Proof  of  Proposition  2.22 


Proof.  Recall  that  Hsp(pi,  ...,pn)  =  7fsp(pi,  ■  ■ .  ,pn,  Vi(V), . . . ,  Vn(V)).  To 
show  the  first  inequality,  let  j  £  {l,...,n}  and  g*  0  int (Vj(V))  be  such 
that  Hsp(pi, .  ■  ■  ,pn)  =  |k*  ~  Pj\\-2-  Since  g*  0  int (Vj(V)),  there  exists  i  £ 
{1, . . .  ,n}  such  that  |k*  ~Pj |k  >  Ik*  ~Pi Ik-  On  the  other  hand,  there  must 
exist  k  £  {1 , ,n}  such  that  g*  £  Wk ■  Now,  if  k  =  j,  then  g*  0  int  (IT,;). 
Therefore, 

Wsp(pi, . . .  ,pn)  =  Ik*  ~Pjh  >  Ik*  ~Pi h 

>  min  Ik  -  Pi||2  >  Wsp(pi,  ...,Pn,Wl,...,  Wn). 

q^int(Wi) 

Now,  if  k  =  i,  then  g*  0  int(lVj).  Therefore, 

Hsp(P)  =  ||g*  -Pjh  >  min  \\q-Pi\\2>/Hsp{pi,...,pn,Wi,...,wn). 

qgint{Wj) 

Finally,  if  k  /  i,j,  then  g*  0  int(VT,)  U  int  (IV,),  and  a  similar  argument 
guarantees  7fsp(pi,  •  •  • ,pn)  >  T~CsP(pi,  ■  ■  ■  ,Pn,  Wi, . . . ,  Wn). 


To  show  the  second  inequality,  let  i  £  {1, . . .  ,n}  and  select  qi  £  IC(Wi). 
The  definition  of  incenter  set  implies  that, 


mm  g 

q&dWi 


Qi  lb  >  min  ||g 

q£dWi 


■  Pi  || 2’ 


The  expression  on  the  left  does  not  depend  on  the  specific  point  selected  in 
the  incenter  set.  Taking  the  minimum  over  all  nodes,  we  deduce 


Hsp(IC(TTi), . . . ,  IC(Wn),  ITi, . . . ,  Wn)  >  Wsp(pi,  ...,pn,W1,...,Wn), 
as  claimed.  ■ 
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2.6  EXERCISES 

E2.1  (Proof  of  Lemma  2.2).  For  S  =  {pi, . . .  ,pn}  £  F(Rd)  with  n  >  2,  prove  the 
following  statements: 

(i)  CC(S)  £  co (S)  \  Ve(co (5)); 

(ii)  if  p  £  co (S)  \  {CC(S)}  and  r  £  R>o  are  such  that  S  C  B(p,r),  then 
]p,  CC(S)[  has  a  nonempty  intersection  with  _B(2±2,  |)  for  a]j  q  £  co(S). 

Hint:  To  show  (i),  invoke  the  definition  of  circumcenter.  To  show  (ii),  distinguish 
between  the  case  when  \\p  —  q\\2  <  r  and  ||p  —  q || 2  =  r.  A  proof  is  contained 
in  [Cortes  et  at.,  2006]. 

E2.2  (The  inclusion  5ld(0  C  Gd  fl f/disk(r)  is  in  general  strict).  Consider  the 
nodes  pi  =  (0,0),  P2  =  (1,0),  and  ps  =  (2,  H).  Pick  r  =  3  and  perform  the 
following  tasks: 

(i)  draw  the  three  points,  their  Voronoi  polygons  and  the  disks  centered  at 
the  points  with  radius  r;  and 

(ii)  show  that  p\  and  ps  are  neighbors  in  the  graph  Gd  PI  t/disk(r),  but  not  in 
the  graph  Gld  (r ) . 

E2.3  (The  proximity  graph  Gd  PI  t/disk(f)  is  not  spatially  distributed  over 

f/disk(r)).  Consider  the  nodes  pi  =  (0,0),  P2  =  (1,0),  pz  =  (2,  ^),  and  74  = 
(0,  y|).  Compute  the  Voronoi  partitions  of  the  plane  generated  by  {pi,P2-P3} 
and  {pi,p2,p3,p4.}.  For  r  =  3,  show  that  pi  and  ps  are  neighbors  in  the  graph 
Gd  n<5disk(r)({pi,p2,P3})  but  not  in  the  graph  Gd  PI  <5disk(r)({pi,p2,P3,P4})-  Why 
does  this  exercise  illustrate  that  Gd  PI  5disk(r)  is  not  spatially  distributed  over 
Gdisk(r)? 

E2.4  (1-center  area  problem).  Let  W  C  R2  be  a  polygon,  (f>  a  density  function  on 
R2  and  a  £  R>o-  Assume  that  the  a-contraction  of  W  is  non-empty.  Consider  the 
area  function  Tti  :  W  — >  R  defined  by 

Hi  (p)  =  /  _  (j>{q)dq  =  A^W  CB{p,a)). 

J  W DB(p,a) 

Justify  informally  why,  at  points  in  the  boundary  of  a  convex  polygon  W,  the  gra¬ 
dient  of  Tti  is  non- vanishing,  and  points  toward  the  interior  of  the  polygon.  (Note 
that  it  is  not  known  whether  the  function  Tti  is  concave  and  how  to  characterize 
critical  points  of  Tti  in  geometric  terms.) 

E2.5  (Proof  of  Proposition  2.15).  This  exercise  asks  you  to  prove  a  statement 
slightly  more  general  than  Proposition  2.15.  Let  {Wi, . . . ,  Wn}  C  P(S')  be  a  parti¬ 
tion  of  S  C  Rd  and  4>  a  density  function  on  Rd .  For  any  {pi, . . .  ,pn},  {p1} . . .  ,pn}  £ 
F(5)  with  the  property  that,  for  all  i  £  {1, . . . ,  n}, 

II Pi  -  CM*(Wi)||a  <  ||pi  -  CM*(Wi)||„ 

show  that: 

Hdistor  (Pi ,  *  *  *  ,  Pn ,  Wl ,  .  .  .  ,  VFn)  ^  Hdistor  (pi ,  ■  ■  ■  ,  Pn ,  IFl ,  •  •  -  ,  UTi) , 

Moreover,  if  all  the  sets  {Wi, . . . ,  W„ }  have  non-vanishing  areas,  the  inequality  is 
strict  if  there  exists  i  £  {1, . . . ,  n}  such  that  Hf^  —  CM.^{Wi)\\2  <  \\pt  —  CM</,(Wi)||2 
Hint:  Use  the  expression  ofTtdistoi  in  (2.3.5). 
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E2.6  (Mixed  distortion-area  multicenter  function).  Show  that  the  expected  mul¬ 
ticenter  function  7 texp  takes  the  form  of  7fdistor-area,a  stated  in  Section  2.3.1  when 
the  performance  function  is 

f(x)  =  -X2  l[0,a](z)  +  b  ■  l]a,  +  oo[(*), 

with  a  £  R>o  and  b  <  —a2. 

Hint:  As  an  intermediate  step,  show  that  for  P  =  (pi,  ■  ■ .  ,Pn)  £  Sn ,  one  has 
Vi(P)n(S\B(Pi,a))  =  Vi{P)n(S\\JS=1B(pk,a))  for  all  i  £  {1, . . . ,  n}. 

E2.7  (Proof  of  Proposition  2.16).  This  exercise  is  a  guided  proof  of  Proposition  2.16. 
Let  W  C  Rd  be  a  connected  set,  <j>  a  density  function  on  Rd  and  a  £  R>o. 
For  p  £  W  and  B  a  closed  ball  centered  at  a  point  in  W  with  radius  a,  define 
(p,  B)  i  *  Hw(p,B)  by 

Hw(p,B)  =  -[  _\\q  ~  p\\l<t>(q)dq  -  [  _  a2  <j>(q)dq. 

JwnB  Jwn(S\B ) 

Do  the  following: 

(i)  Show  that  the  multicenter  function  7fdistor-area,a  admits  the  expression 

n 

^distor-areaja  (pi,...,pn,Wi,...,W„)  =  Y 'Hwi(pi,B(pi,a))\ 

i=  1 

(ii)  given  a  closed  ball  B  centered  at  a  point  in  W  with  radius  a,  show  that 
for  any  p  £  W, 

Tlw(CM^(W  fl  B),  B)  >  Hw(p,  B), 

with  strict  inequality  unless  p  =  PI  B); 

Hint:  Use  the  Parallel  Axis  Theorem  [Hibbeler,  2006]; 

(iii)  given  p  £  W,  show  that  for  any  closed  ball  B  centered  at  a  point  in  W 
with  radius  a , 

Hw(p,  B(p,  a))  >  Hw(p,  B); 

Hint:  Consider  the  decomposition  of  W  given  by  the  union  of  the  disjoint 
sets  B(p,a)C\B,  B(p,a)r\(W\B),  (W\B(p,a))nB  and  (VE\B(p,a))n 
(W  \  B),  and  compare  the  integrals  over  each  set; 

(iv)  deduce,  using  (ii)  and  (iii),  that 

Hw(CM$(W  n  B(p,  a)),  B(CM$(W  n  B(p,  a)),  a))  >  Hw(p,  B(p,  a)), 

with  strict  inequality  unless  p  =  fl  B)\  and 

(v)  combine  (i)  and  (iv)  to  prove  Proposition  2.16. 

E2.8  (Locally-cliqueless  proximity  graph).  Give  an  example  of  an  allowable  en¬ 
vironment  Q  and  a  configuration  of  points  such  that  the  inclusions  of  Theo¬ 
rem  2.12(i), 


Semst.s  C  Q\c,q  C  Q 


are  strict  for  Q  —  Qv,s,q- 
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E2.9  (Properties  of  the  locally-cliqueless  graph).  Prove  Theorem  2.12. 

Hint:  This  exercise  has  considerable  theoretical  content.  To  prove  Theorem  2.12(i), 
use  an  argument  by  contradiction  to  show  that  the  first  inclusion  holds,  and  the 
definition  of  locally-cliqueless  graph  to  show  that  the  second  inclusion  holds. 

E2.10  (When  are  the  total  derivative  and  the  partial  derivative  of  a  function 
equal?).  Assume  /  :  R  x  R  — >  R  is  continuously  differentiable  in  its  both  argu¬ 
ments  and  let  di  f  its  partial  derivative  with  respect  to  its  first  argument.  Assume 
the  function  y*  :  R  — >  R  satisfies,  for  each  x  £  R, 

f(x,y*(x))  =  max{/(i,  z)  \  z  £  R}, 
and  is  continuously  differentiable.  Show  that: 

^f(x,y*{x))  =  dif(x,y*(x)). 

Explain  how  this  result  gives  insight  into  the  expression  of  the  gradient  of  77exP 
in  Theorem  2.17(ii)  for  a  continuously  differentiable  performance  function. 

E2.ll  (Distortion  gradient  ascent  flow).  Given  a  (convex)  polytope  S  C  Rd  and  a 
density  function  (j>,  consider  n  nodes  pi, . . .  ,pn  evolving  under  the  continuous-time 
gradient  ascent  flow  of  the  multicenter  function  77distor, 

Pi=2A^Vi(V))(CM^Vi(V))~pi)1  ie  nj. 


(i)  What  are  the  equilibrium  points? 

(ii)  show  that  7fdistor  is  monotonically  non-decreasing  along  the  flow; 

(iii)  justify  that  the  polytope  S  is  invariant,  i.e.,  that  the  trajectories  of  the 
system  remain  in  S; 

(iv)  use  (i)-(iii)  to  apply  the  LaSalle  Invariance  Principle  and  show  that  the 
solutions  of  the  flow  converge  to  the  set  of  centroidal  Voronoi  configura¬ 
tions  in  S',  and 

(v)  implement  numerically  the  flow  in  the  software  of  your  choice.  Select  the 
unit  square  S  =  [0, 1]  x  [0, 1]  and  the  density  function 

<t>  =  exp  (  -  (x  -  ^)2  -  (y  -  ^)2)  +exp  (  —  (x  —  ^)2  -  (y  -  ^)2)  . 

Run  simulations  from  different  initial  conditions  and  with  different  num¬ 
bers  of  nodes.  Show  by  illustration  that  multiple  local  maxima  exist. 

E2.12  (Area  gradient  ascent  flow).  Given  a  (convex)  polytope  S  C  R  ,  a  density 
function  fi,  and  a  radius  a  £  R>o,  consider  n  nodes  pi, ...  ,pn  evolving  under  the 
continuous-time  gradient  ascent  flow  of  the  multicenter  function  77area,a, 

Pi=  _  nout, B(Pi,a)(l)<l>(q)dq,  i€{l,...,n}. 

JViCP)  n  dB(Pi,a) 


(i)  What  are  the  equilibrium  points? 

(ii)  show  that  Tta.iea,a  is  monotonically  non-decreasing  along  the  flow; 
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(iii)  justify  that  the  polytope  S  is  invariant,  i.e. ,  that  the  trajectories  of  the 
system  remain  in  S; 

(iv)  use  (i)-(iii)  to  apply  the  LaSalle  Invariance  Principle  and  show  that  the 
solutions  of  the  flow  converge  to  the  set  of  a-limited  area-centered  Voronoi 
configurations  in  S',  and 

(v)  implement  numerically  the  flow  in  the  software  of  your  choice.  Select  the 
unit  square  S  =  [0, 1]  x  [0, 1],  the  density  function 

cf>{x,y)  =  exp  (  -  (x-  ^)2  -  (y  -  ^)2)  +  exp  (  -  (x  -  ^)2  -  (y  -  ^)2), 

and  the  parameter  a  =  Run  simulations  from  different  initial  con¬ 
ditions  and  with  different  numbers  of  nodes.  Show  by  illustration  that 
multiple  local  maxima  exist. 
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Chapter  Three 

Robotic  network  models  and  complexity  notions 


This  chapter  introduces  the  main  subject  of  study  of  this  book,  namely  a 
model  for  groups  of  robots  that  can  sense  their  own  position,  communi¬ 
cate  messages  according  to  a  geometric  communication  topology,  process 
information,  and  control  their  motion.  We  refer  to  such  systems  as  robotic 
networks.  The  exposition  of  this  chapter  has  evolved  from  [Martinez  et  ah, 
2007a], 

The  chapter  is  organized  as  follows.  The  first  section  of  the  chapter  con¬ 
tains  the  formal  model.  We  begin  by  presenting  the  physical  components  of  a 
network,  that  is,  the  mobile  robots  and  the  communication  service  connect¬ 
ing  them.  We  then  present  the  notion  of  control  and  communication  law, 
and  how  a  law  is  executed  by  a  robotic  network.  These  notions  subsume 
the  notions  of  synchronous  network  and  distributed  algorithm  described  in 
Section  1.5.  As  an  example  of  these  notions,  we  introduce  a  simple  law, 
called  the  agree  and  pursue  law,  which  combines  ideas  from  leader  election 
algorithms  and  from  cyclic  pursuit  (i.e. ,  a  game  in  which  robots  chase  each 
other  in  a  circular  environment).  In  the  second  section  of  the  chapter,  we 
propose  a  model  of  groups  of  robots  that  interact  through  sensing,  rather 
than  communication.  The  third  section  of  the  chapter  discusses  time,  space, 
and  communication  complexity  notions  for  robotic  networks  as  extensions 
of  the  corresponding  notions  for  distributed  algorithms.  The  complexity  no¬ 
tions  rely  on  the  basic  concept  of  coordination  task  and  task  achievement. 
The  fourth  and  last  section  of  the  chapter  establishes  the  time,  space,  and 
communication  required  by  the  agree  and  pursue  law  to  steer  a  group  of 
robots  to  a  uniformly-spaced  rotating  configuration.  We  end  the  chapter 
with  three  sections  on,  respectively,  bibliographical  notes,  proofs  of  the  re¬ 
sults  presented  in  the  chapter,  and  exercises. 


3.1  A  MODEL  FOR  SYNCHRONOUS  ROBOTIC  NETWORKS 

Here  we  introduce  a  model  for  a  synchronous  robotic  network.  This  model  is 
an  extension  of  the  synchronous  network  model  in  Section  1.5.1.  We  start  by 
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detailing  the  physical  components  of  the  network,  which  include  the  robots 
themselves  as  well  as  the  communicate  service  among  them. 


3.1.1  Physical  components 

Let  us  start  by  providing  a  basic  definition  of  robot  and  a  model  for  how 
each  robot  moves  in  space. 

A  mobile  robot  is  a  continuous-time  continuous-space  dynamical  system 
as  defined  in  Section  1.3,  that  is,  a  tuple  (X,U,  Xq,  f),  where 

(i)  X  is  d-dimensional  space  chosen  among  M.d ,  §rf,  and  the  Cartesian 
products  McZl  x  S^2,  for  some  d\  +  d2  =  d,  called  the  state  space ; 

(ii)  U  is  a  compact  subset  of  Rm  containing  0m,  called  the  input  space ; 

(iii)  Xq  is  a  subset  of  X ,  called  the  set  of  allowable  initial  states ; 

(iv)  /  :  X  x  U  — >  is  a  continuously  differentiable  control  vector  field 

on  X,  that  is,  /  determines  the  robot  motion  x  :  M>o  — ►  X  via  the 
differential  equation,  or  control  system, 

x(t)  =  f(x(t),u(t)),  (3.1.1) 

subject  to  the  control  u  :  M>o  — >  U. 

We  will  use  the  terms  robot  and  agent  interchangeably.  We  refer  toi£l 
and  u  G  U  as  a  physical  state  and  an  input  of  the  mobile  robot,  respectively. 
Most  often  the  physical  state  will  have  the  interpretation  of  a  location,  or  a 
location  and  velocity.  We  will  often  consider  control-affine  vector  fields.  In 
such  a  case,  we  represent  /  as  the  ordered  family  of  continuously  differen¬ 
tiable  vector  fields  (/o,  fi, . . . ,  fm)  on  X.  In  general  the  control  signal  u  will 
not  depend  only  on  time  but  also  on  x  and  possible  other  variables  in  the 
system.  Note  that  there  is  no  additional  difficulty  in  modeling  mobile  robots 
using  dynamical  systems  defined  on  manifolds  [Bullo  and  Lewis,  2004],  but 
we  avoid  it  here  in  the  interest  of  simplicity. 

Example  3.1  (Planar  vehicle  models).  The  following  models  of  control 
systems  are  commonly  used  in  robotics,  beginning  with  the  early  works 
in  [Dubins,  1957,  Reeds  and  Shepp,  1990].  Figure  3.1(a)  and  (b)  show  a  two¬ 
wheeled  vehicle  and  a  four-wheeled  vehicle,  respectively.  The  two-wheeled 
planar  vehicle  is  described  by  the  dynamical  system: 

x  =  vcos9,  y  =  vsin9,  6  =  uj,  (3.1.2) 

with  state  variables  x  E  R,  y  E  M  and  9  E  S1,  describing  the  planar  position 
and  orientation  of  the  vehicle,  and  with  controls  v  and  oj,  describing  the 
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forward  linear  velocity  and  the  angular  velocity  of  the  vehicle.  Depending 


(a)  (b) 

Figure  3.1  (a)  Two- wheeled  vehicle  and  (b)  four-wheeled  vehicle.  In  each  case,  the  orien¬ 
tation  of  the  vehicle  is  indicated  by  the  small  triangle. 

on  what  set  the  controls  are  restricted  to,  we  define  the  models: 


The  unicycle.  The  controls  v  and  uj  take  value  in  [—1,1]  and  [—1, 1],  re¬ 
spectively. 

The  differential  drive  robot.  Set  v  =  (o;right  +  wieft)/2  and  uj  =  (a;right  — 
wieft )  /2  and  assume  both  curight  and  wieft  take  value  in  [—1, 1], 

The  Reeds— Shepp  car.  The  control  v  takes  values  in  {—1,0, 1}  and  the 
control  uj  takes  values  in  [—1,1]. 

The  Dubins  vehicle.  The  control  v  is  set  equal  to  1  and  the  control  uj 
takes  value  in  [—1, 1], 


Finally,  the  four-wheeled  planar  vehicle,  composed  of  a  rear  and  a  front  axle 

separated  by  a  distance  £,  is  described  by  the  same  dynamical  system  (3.1.2) 

with  the  following  distinctions:  ( x ,  y)  E  K2  is  the  position  of  the  midpoint 

of  the  rear  axle,  6  E  S1  is  the  orientation  of  the  rear  axle,  the  control  v  is 

the  forward  linear  velocity  of  the  rear  axle  and  the  angular  velocity  satisfies 
v 

uj  =  —  tan  cf>,  where  the  control  (f>  is  the  steering  angle  of  the  vehicle.  • 

The  following  definition  is  a  generalization  of  the  concept  of  synchronous 
network  introduced  in  Definition  1.35. 

Definition  3.2  (Robotic  network).  The  physical  components  of  a  robotic 
network  S  consist  of  a  tuple  (1, 1Z,  Ecmm),  where 

(i)  I  =  {1, . . . ,  n};  I  is  called  the  set  of  unique  identifiers  ( UIDs ); 
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(ii)  1Z  =  {i?W}je/  =  {(XW,  f/M,  X^, /W)}ie/  is  a  set  of  mobile  robots; 

(iii)  ficmm  is  a  map  from  Re/  X i'i  to  the  subsets  of  /  x  /;  this  map  is 
called  the  communication  edge  map. 

If  /?[*]  =  (X,  17,  Xq,  /)  for  all  i,  then  the  robotic  network  is  called  uniform .• 


Remarks  3.3  (Notational  conventions  and  meaning  of  the  commu¬ 
nication  edge  map). 

(i)  Following  the  convention  established  in  Section  1.5,  we  let  the  su¬ 
perscript  [z]  denote  the  variables  and  spaces  which  correspond  to  the 
robot  with  unique  identifier  z;  for  instance,  xM  E  XM  and  E  X^ 
denote  the  physical  state  and  the  initial  physical  state  of  robot  , 
respectively.  We  refer  to  x  =  (xR . . . ,  x^)  E  fPe/  xw  as  a  state 
of  the  network. 

(ii)  The  map  x  i— >  (I,  Ecmm(x))  models  the  topology  of  the  communica¬ 
tion  service  among  the  robots:  at  a  physical  state  x  =  (xM, . . . ,  x^), 
two  robots  at  locations  x  M  and  x^  can  communicate  if  and  only  if 
the  pair  (z.  j)  is  an  edge  in  Ecmm(x)  =  Ecmmix^. . . .  ,  x^).  Accord¬ 
ingly,  we  refer  to  (I,Ecmm(x))  as  the  communication  graph  at  x. 
When  and  what  robots  communicate  is  discussed  in  Section  3.1.2. 

As  communication  graphs,  we  will  often  adopt  one  of  the  proximity 
graphs  discussed  in  Section  2.2  and  in  particular  the  (undirected) 
disk  graph.  • 

To  make  things  concrete,  let  us  present  some  examples  of  robotic  networks 
that  will  be  commonly  used  later. 

Example  3.4  (First-order  robots  with  range-limited  communica¬ 
tion).  Consider  a  group  of  robots  moving  in  Md,  d  >  1.  As  in  Chapter  2, 
we  let  p  denote  a  point  in  and  we  let  {pi1! , . . . ,  p^ }  denote  the  robot 
locations.  Assume  the  robots  move  according  to 

pM(t)  =  uW(t),  (3.1.3) 

with  yW  g  [—umax,  umax]-  See  Figure  3.2  for  an  illustration.  According  to 
our  mobile  robot  notation,  these  are  identical  robots  of  the  form 

(Rd,  [-umax,  umax]d,  Md,  (0d,  ei, . . . ,  ed)). 

We  assume  that  each  robot  can  sense  its  own  position  and  can  communicate 
to  any  other  robot  within  distance  r,  that  is,  we  adopt  the  r-disk  graph 
t/disk)7")  defined  in  Section  2.2  as  communication  graph.  These  data  define 
the  uniform  robotic  network  5disk- 
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Figure  3.2  Omnidirectional  vehicle.  In  addition  to  controlling  the  rotation  speed  of  the 
wheels,  the  vehicle  can  also  actuate  the  direction  in  which  they  point.  This 
allows  the  vehicle  to  move  in  any  direction  according  to  the  first-order  dynam¬ 
ics  (3.1.3). 


It  shall  also  be  interesting  to  consider  first-order  robots  with  communi¬ 
cation  graphs  other  than  the  disk  graph;  important  examples  include  the 
Delaunay  graph  fo,  the  limited  Delaunay  graph  £/ld(0>  and  the  oo-disk 
graph  1/ oo-disk (i1 )?  discussed  in  Section  2.2.  These  three  graphs,  adopted  as 
communication  models,  give  rise  to  three  robotic  networks  denoted  5ld> 
Soo-disk,  respectively.  • 

Example  3.5  (Planar  vehicle  robots  with  Delaunay  communica¬ 
tion).  We  consider  a  group  of  vehicle  robots  moving  in  an  allowable  envi¬ 
ronment  Q  C  R2  according  to  the  planar  vehicle  dynamics  introduced  in 
Example  3.1.  We  let  {(pM,  #M), . . . ,  (pM,  #["!)}  denote  the  robot  physical 
states,  where  pM  =  (x'W, yW)  e  Q  corresponds  to  the  position  and  g  S1 
corresponds  to  the  orientation  of  the  robot  i  €  /.  As  communication  graph, 
we  adopt  the  Delaunay  graph  Qd  on  Q  introduced  in  Section  2.2.  These 
data  define  the  uniform  robotic  network  5vehicies-  • 

Example  3.6  (First-order  robots  with  line-of-sight  communication). 

We  consider  a  group  of  robots  moving  in  an  allowable  environment  Q  C  K2. 
As  in  Example  3.4,  we  let  {pW, . . .  ,p^}  denote  the  robot  locations  and  as¬ 
sume  the  robots  move  according  to  the  motion  model  (3.1.3).  Each  robot 
can  sense  its  own  position,  the  boundary  of  dQ,  and  can  communicate  to 
any  other  robot  within  distance  r  and  within  line  of  sight,  that  is,  we  adopt 
the  range-limited  visibility  graph  f/vis-disk,Q  in  Q  defined  in  Section  2.2  as 
the  communication  graph.  These  data  define  the  uniform  robotic  network 

‘-^vis-disk  •  * 

Example  3.7  (First-order  robots  in  S1).  Consider  n  robots  {0M, . . . ,  9^} 
in  S1 ,  moving  along  on  the  unit  circle  with  angular  velocity  equal  to  the  con- 
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trol  input.  Each  robot  is  described  by  the  tuple  (S1,  [—Umax;  Mmavj.S1,  (0,  e)), 
where  e  is  the  vector  field  on  S1  describing  unit-speed  counterclockwise  ro¬ 
tation.  As  in  the  previous  examples,  we  assume  that  each  robot  can  sense 
its  own  position  and  can  communicate  to  any  other  robot  within  distance 
r  along  the  circle,  that  is,  we  adopt  the  r-disk  graph  <7diskM  on  S1  defined 
in  Section  2.2  as  the  communication  graph.  These  data  define  the  uniform 
robotic  network  5circie-  • 

We  conclude  this  section  with  a  remark. 

Remark  3.8  (Congestion  models  in  robotic  networks).  The  behav¬ 
ior  of  a  robotic  network  might  be  affected  by  communication  and  physical 
congestion  problems. 


Communication  congestion:  Omnidirectional  wireless  transmissions  in¬ 
terfere.  Clear  reception  of  a  signal  requires  that  no  other  signals  are 
present  at  the  same  point  in  time  and  space.  In  an  ad  hoc  network, 
node  i  receives  a  message  transmitted  by  node  j  only  if  all  other  neigh¬ 
bors  of  i  are  silent.  In  other  words,  the  transmission  medium  is  shared 
among  the  agents.  As  the  density  of  agents  increases,  so  does  wireless 
communication  congestion.  Asymptotic  and  optimization  results  are 
known  on  this  regard. 

First,  for  ad  hoc  networks  with  n  uniformly  randomly  placed  nodes, 
it  is  known  [Gupta  and  Kumar,  2000]  that  the  maximum-throughput 
communication  range  r(n)  of  each  node  decreases  as  the  density  of 
nodes  increases;  in  d  dimensions  the  appropriate  scaling  law  is  r(n)  E 
@((log(n)/n)1//rf).  This  is  referred  to  as  the  connectivity  regime  in 
percolation  theory  and  statistical  mechanics.  Using  the  /c-nearest- 
neighbor  graph  over  uniformly  placed  nodes,  the  analysis  in  [Xue  and  Kumar, 
2004]  suggests  that  the  minimal  number  of  neighbors  in  a  connected 
network  grows  with  log(n). 

Second,  a  growing  body  of  literature  [Santi,  2005,  Lloyd  et  ah,  2005] 
is  available  on  topology  control ,  i.e. ,  on  how  to  compute  transmission 
power  values  in  an  ad  hoc  network  so  as  to  minimize  energy  consump¬ 
tion  and  interference  (due  to  multiple  sources) ,  while  achieving  various 
graph  topological  properties,  such  as  connectivity  or  low  network  di¬ 
ameter. 

Physical  congestion.  Robots  can  collide:  it  is  clearly  important  to  avoid 
“simultaneous  access  to  the  same  physical  area”  by  multiple  robots. 

It  is  reasonable  to  assume  that,  as  the  number  of  robots  increase,  so 
should  the  area  available  for  their  motion.  An  alternative  convenient 
approach  is  the  one  taken  by  Sharma  et  al.  [2007],  where  robots’  safety 
zones  decrease  with  decreasing  robots’  speed.  This  suggests  that,  in  a 
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fixed  environment,  individual  nodes  of  a  large  ensemble  have  to  move 
at  a  speed  decreasing  with  n,  and  in  particular,  at  a  speed  proportional 
to  Roughly  speaking,  if  the  overall  volume  V  where  the  groups 

of  agents  move  is  constant,  and  there  are  n  robots,  then  the  speed  v 
they  can  move  at  goes  approximately  as  vd  «  ^ . 

In  summary,  one  way  to  incorporate  congestion  effects  into  the  robotic 
network  model  is  to  assume  that  the  parameters  of  the  physical  components 
of  the  network  depend  upon  the  number  of  robots  n.  In  the  limit  as  n  — *  +oo 
we  will  sometimes  assume  that  r  and  um ax,  the  communication  range  and 
the  velocity  upper  bound  in  Examples  3.4  and  3.7,  are  of  order  n ~1/d.  • 


3.1.2  Control  and  communication  laws 

Here  we  present  a  discrete-time  communication,  continuous-time  motion 
model  for  the  evolution  of  a  robotic  network  subject  to  a  communication 
and  control  law.  In  our  model,  each  robot  evolves  in  the  physical  domain 
in  continuous  time,  senses  its  position  in  continuous  time,  and,  in  discrete 
time,  exchanges  information  with  other  robots  and  executes  a  state  machine, 
which  we  shall  refer  to  as  a  processor.  The  following  definition  is  a  general¬ 
ization  of  the  concept  of  distributed  algorithm  introduced  in  Definition  1.36 
and  of  the  classical  notion  of  dynamical  feedback  controller. 

Definition  3.9  (Control  and  communication  law).  A  control  and  com¬ 
munication  law  CC  for  a  robotic  network  S  consists  of  the  sets: 

(i)  A,  a  set  containing  the  null  element,  called  the  communication 
alphabet ;  elements  of  A  are  called  messages ; 

(ii)  W®,  i  S  /,  called  the  processor  state  sets ; 

(iii)  W'f  C  lew ,  i  e  I,  sets  of  allowable  initial  values ; 

and  of  the  maps: 

(i)  msgW  :  X®  x  W®  x  /  — >  A,  i  G  /,  called  message-generation  func¬ 
tions ; 

(ii)  stf'W  :  xM  x  W®  x  An  — »  W^\  i  G  I,  called  (processor)  state- 
transition  functions ; 

(iii)  ctl^  :  xM  x  X^  x  x  An  — >  t/M,  j  g  J]  called  (motion)  control 
functions. 

If  S  is  uniform  and  if  =  W,  msgW  =  msg,  stf^  =  stf,  ctl^  =  ctl, 
for  all  i  €  /,  then  CC  is  said  to  be  uniform  and  is  described  by  a  tuple 
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(A,  W,  {W^}ieI,  msg,  stf,  ctl). 


We  will  sometimes  refer  to  a  control  and  communication  law  as  a  dis¬ 
tributed  motion  coordination  algorithm.  Roughly  speaking,  the  rationale 
behind  Definition  3.9  is  the  following,  see  Figure  3.3.  The  state  of  robot  i 


f 

V 


Update 

processor 

state 


A 

J 


Update  physical  state 


Figure  3.3  Execution  of  a  control  and  communication  law  by  a  robotic  network. 

includes  both  the  physical  state  and  the  processor  state  w W  £ 

of  the  state  machine  that  robot  i  implements.  These  states  are  initialized 
with  values  in  their  corresponding  allowable  initial  sets  A7^  and  .  We 
assume  the  robot  can  sense  it  own  physical  position  x'W .  At  each  time  instant 
i  £  Z>o,  robot  i  sends  to  each  of  its  out-neighbors  j  in  the  communication 
digraph  (/,  F?cmm(x))  a  message  (possibly  the  null  message)  computed  by 
applying  the  message-generation  function  msg^  to  the  current  values  of  its 
physical  state  x^ ,  processor  state  and  to  the  identity  j.  Subsequently, 
but  still  at  the  time  instant  l  £  Z>o,  robot  i  updates  the  value  of  its  proces¬ 
sor  state  w W  by  applying  the  state-transition  function  stf^  to  the  current 
value  of  its  physical  state  x'M ,  processor  state  7i,’W  and  to  the  messages  it 
receives  from  its  in-neighbors.  Between  communication  instants,  i.e.,  for 
t  £  [1,1+1)  for  some  t  £  Z>o,  the  motion  of  the  ith  robot  is  determined  by 
applying  the  control  function  to  the  current  value  of  x^\  the  value  of  x M  at 
time  £ ,  the  current  value  of  w  W ,  and  the  messages  received  at  time  i.  This 
evolution  model  is  very  similar  to  the  one  we  introduced  for  synchronous 
networks  in  Definition  1.37:  at  each  communication  round,  the  first  step 
is  transmission  and  the  second  one  is  computation  and,  except  for  the  de¬ 
pendence  on  the  physical  state  x,  the  communication  and  state  transition 
processes  are  identical. 

These  ideas  are  formalized  in  the  following  definition. 
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Definition  3.10  (Evolution  of  a  robotic  network).  Let  CC  be  a  control 

and  communication  law  for  the  robotic  network  S.  The  evolution  of  ( S,CC ) 

m  \i]  \i]  \i I 

from  initial  conditions  Xq  E  Xq  and  U!q  E  Wq  ,  i  E  /,  is  the  collection  of 
curves  x^  :  M>o  — >  X^  and  tcM  :  Z>o  — ■*  W^\  i  E  /,  defined  by 

iW(t)  =  /(*%),  ctl W (a: W (t) , a; W ( ^ J ) , «; W ( LtJ ) , 2/ W ( UJ )) ) , 

where  |_tj  =  max{f  6  Z>o  |  l  <  t},  and 

w®{£)  =  stfW(xW(f),u;W(^-  1  ),y[i](£)), 


with  x^(0)  =  x[^,  and  u>M(—  1)  =  ?Xq  ,  i  E  /.  In  the  previous  equations,  yM  : 
Z>o  — >  An  (describing  the  messages  received  by  processor  i )  has  components 
(£) .  for  j  E  /,  defined  by 


fmsgW(xW(£),u;W(^-  l),i),  if  (j,i)  e  Ecmm(x^ (£),..., x^ (£)) , 
[null,  otherwise. 


For  convenience,  we  define  rr(f)  =  rc(L^J)  for  all  t  E  M>o,  and  let  M>o  3 
1 1— >  (x(t),w(t))  denote  the  curves  xM  and  u;W,  for  i  E  {1, . . . ,  n}. 


Remarks  3.11  (Simplifications  of  control  and  communication  laws). 

(i)  A  control  and  communication  law  CC  is  static  if  the  processor  state 
set  WM  is  a  singleton  for  all  i  €  I.  This  means  that  there  is  no 
meaningful  evolution  of  the  processor  state.  In  this  case  CC  can  be 
described  by  a  tuple  (A,  {msgM}je/,  {ctlM}ie/)>  with  rnsgM  :  XM  x 
I  —>  A,  and  ctl[i]  :  X®  x  X^  x  A”  — >  U®,  for  i  E  /; 

(ii)  A  control  and  communication  law  CC  is  data-sampled  if  the  control 
functions  are  independent  of  the  current  position  of  the  robot  and 
depend  only  upon  the  robot  position  at  last  sample  time.  Specif¬ 
ically,  the  control  functions  have  the  following  property:  given  a 
processor  state  w M  E  W^,  an  array  of  messages  y M  E  An,  a  current 
state  x'M,  and  a  state  at  last  sample  time  x^  ld,  the  control  input 

ctl^(xW,  Xg^pjj,  u;W,  yW)  is  independent  of  x^,  for  all  iEl.  In  this 
case  the  control  functions  can  be  described  by  maps  of  the  form 
ctl[i]  :  XM  x  W®  xA”-4  J/M,  for  *  E  /. 

(iii)  In  many  control  and  communication  laws,  the  robots  exchange  their 
states,  including  both  their  processor  and  their  physical  states. 
For  such  laws,  we  identify  the  communication  alphabet  with  A  = 
(X  x  W)  Ujnull}  and  we  refer  to  the  corresponding  message  gen- 
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eration  function  msgstd(x,  w,  j)  =  (x,w)  as  the  standard  message- 
generation  function. 

Note  that  we  allow  the  processor  state  set  and  the  communication  alphabet 
to  contain  an  infinite  number  of  symbols.  In  other  words,  we  assume  that  a 
robot  can  store  and  transmit  a  (finite  number  of)  integer  and  real  numbers, 
among  other  things.  This  is  equivalent  to  assuming  that  we  neglect  any 
inaccuracies  due  to  quantization,  as  we  did  in  Section  1.6.  • 

Remark  3.12  (Extensions  of  control  and  communication  laws). 

Here  we  briefly  discuss  alternative  models  and  extensions  of  the  proposed 
models. 


Asynchronous  sensor-based  interactions.  In  the  network  model  pro¬ 
posed  by  Suzuki  and  Yamashita  [1999],  robots  are  referred  to  as  “anony¬ 
mous”  and  “oblivious”  in  precisely  the  same  way  in  which  we  defined 
control  and  communication  laws  to  be  uniform  and  static,  respec¬ 
tively.  As  compared  with  our  notion  of  robotic  network,  the  model 
in  [Suzuki  and  Yamashita,  1999]  is  more  general  in  that  the  robots’ 
activations  schedules  do  not  necessarily  coincide  (i.e. ,  this  model  is 
asynchronous),  and  at  the  same  time  it  is  less  general  in  that  (1) 
robots  cannot  communicate  any  information  other  than  their  respec¬ 
tive  positions,  and  (2)  each  robot  observes  every  other  robot’s  position 
(i.e.,  the  complete  communication  graph  is  adopted).  In  the  next  Sec¬ 
tion  3.2  we  present  a  model  where  robots  rely  on  sensing  rather  than 
communication  for  their  interaction. 

Discrete  and  continuous  time  in  motion  and  communication.  In  some 
cases  it  will  be  convenient  to  consider  discrete-time  motion  models;  for 
example,  we  present  discrete-time  motion  models  for  first  order  agents 
in  Section  4.1.  In  some  other  cases,  it  will  be  convenient  to  consider 
dynamical  interactions  between  agents  taking  place  in  continuous  time. 

Stochastic  link  models.  Although  we  do  not  present  any  result  on  this 
topic  in  this  notes,  it  is  possible  to  develop  robotic  networks  mod¬ 
els  over  random  graphs  and  random  geometric  graphs,  as  studied 
by  Bollobas  [2001],  Penrose  [2003].  Furthermore,  it  is  of  interest  to 
consider  communication  links  with  time- varying  rates.  • 


3.1.3  Agree  and  pursuit  control  and  communication  law 

We  conclude  this  section  with  an  example  of  a  dynamic  control  and  com¬ 
munication  law.  The  problem  is  described  as  follows:  a  collection  of  robots 
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with  range-limited  communication  are  placed  on  the  unit  circle;  the  robots 
move  and  communicate  with  the  objectives  of  (1)  agreeing  on  a  direction  of 
motion  (clockwise  or  counterclockwise)  and  (2)  achieving  an  equidistant  con¬ 
figuration  where  all  robots  are  equally  angularly-spaced.  To  achieve  these 
two  objectives,  we  combine  ideas  from  leader  election  algorithms  for  syn¬ 
chronous  networks  (see  Section  1.5.4)  and  from  cyclic  pursuit  problems  (see 
Exercise  El. 29):  the  robots  move  a  distance  proportional  to  an  appropriate 
inter-robot  separation,  and  they  repeatedly  compare  their  identifiers  to  dis¬ 
cover  the  direction  of  motion  of  the  robot  with  the  largest  identifier.  In  other 
words,  the  robots  run  a  leader  election  task  in  their  processor  states  and  a 
uniform  robotic  deployment  task  in  their  physical  state  —  these  are  among 
the  most  basic  tasks  in  distributed  algorithms  and  cooperative  control.  We 
present  the  algorithm  here  and  characterize  its  correctness  and  performance 
later  in  the  chapter. 

From  Example  3.7,  we  consider  the  uniform  network  5circic  of  locally- 
connected  first-order  robots  on  S1.  For  r,  rtmax,  kprop  £  ] 0,  |[  with  kpmpr  < 
Mmm,  we  define  the  AGREE  &  pursue  law,  denoted  by  CCAGREE  &  pursue,  as 
the  uniform  data-sampled  law  loosely  described  as  follows: 


[Informal  description]  The  processor  state  consists  of  dir  (the 
robot’s  direction  of  motion)  taking  values  in  {c,  cc}  (meaning 
clockwise  and  counterclockwise)  and  max- id  (the  largest  UID 
received  by  the  robot,  initially  set  to  the  robot’s  UID)  taking 
values  in  I.  At  each  communication  round,  each  robot  transmits 
its  position  and  its  processor  state.  Among  the  messages  received 
from  agents  moving  towards  its  position,  each  agent  picks  the 
message  with  the  largest  value  of  max-id.  If  this  value  is  larger 
than  its  own  value,  the  agent  resets  its  processor  state  with  the 
selected  message.  Between  communication  rounds,  each  robot 
moves  in  the  clockwise  or  counterclockwise  direction  depending 
on  whether  its  processor  state  dir  is  c  or  cc.  Each  robot  moves 
fcprop  times  the  distance  to  the  immediately  next  neighbor  in  the 
chosen  direction,  or,  if  no  neighbors  are  detected,  kprop  times  the 
communication  range  r. 


Note  that  the  processor  state  with  the  largest  UID  will  propagate  through¬ 
out  the  network  as  in  the  floodmax  algorithm  for  leader  election.  Also, 
note  that  the  assumption  kpmpr  <  ttmax  guarantees  that  the  desired  control 
is  always  within  the  allowable  range  «m«].  Next,  we  define  the  law 

formally. 


Robotic  Network:  5circie,  first-order  agents  in  S1 
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with  absolute  sensing  of  own  position,  and 
with  communication  range  r 

Distributed  Algorithm:  AGREE  &  PURSUE 
Alphabet:  A  =  S1  x  {c,  cc}  x  I  U{null} 

Processor  State:  w  =  (dir, max-id),  where 

dir  E{c,cc},  initially:  dirW  unspecified 

max- id  E  /,  initially:  max- id M  =  i  for  all  i 

%  Standard  message  generation  function 
function  msg(0,  w,  i ) 
i:  return  ( 9 ,  w) 

function  stf(0,  w,  y ) 

i:  for  each  non-null  message  (#rcvd,  (dirrcvd,  max-idrcvd))  in  y  do 
2:  if  (max-idrcvd  >  max-id)  AND  (distcc(0,  0rcvd)  <  r  AND  dirrcv<j  = 

c)  OR  (distc(0,  0rcvd)  <  r  AND  dirrcvd  =  cc)  then 
3:  new-dir  :=  dirrcvd 

4:  new-id  :=  max-idrcvd 

5:  return  (new-dir,  new-id) 

function  ctl(0smpid,  w,  y) 

^tmp  • —  f 

2:  for  each  non-null  message  ($rcvd>  (dirrcvd,max-idrcvd))  in  y  do 
3:  if  (dir  —  cc)  AND  (distcc(0smpid,  drcvd)  ^  <4mp)  then 

4:  dtmp  :=  dist  cc  (^smpld  j  $rcvd) 

5:  'Amp  • —  fcpropdtmp 

6:  if  (dir  —  c)  AND  (distc(0smpid,  drcvd)  ^  dtmp')  then 

7-  duTip  • —  dist c  ( 0Smpld )  $rcvd  ) 

Amp  . —  ^propdtmp 
9:  return  Utmp 


An  implementation  of  this  control  and  communication  law  is  shown  in 
Figure  3.4.  As  parameters  we  select  n  =  45,  r  =  27t/40,  umax  =  1/4  and 
fcprop  =  7/16.  Along  the  evolution,  all  robots  agree  upon  a  common  direction 
of  motion  and,  after  suitable  time,  they  reach  a  uniform  distribution. 

3.2  ROBOTIC  NETWORKS  WITH  RELATIVE  SENSING 

The  model  presented  above  assumes  the  ability  of  each  robot  to  know  its  own 
absolute  position.  Here,  we  treat  the  alternative  setting  in  which  the  robots 
do  not  communicate  amongst  themselves,  but  instead  detect  and  measure 
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Figure  3.4  The  AGREE  &  PURSUE  law.  Disks  and  circles  correspond  to  robots  moving 
counterclockwise  and  clockwise,  respectively.  The  initial  positions  and  the 
initial  directions  of  motion  are  randomly  generated.  The  five  pictures  depict 
the  network  state  at  times  0,  9,  20,  100,  and  800. 


each  other’s  relative  position  through  appropriate  sensors.  Additionally, 
we  assume  that  the  robots  will  perform  measurements  of  the  environment 
without  having  any  a  priory  knowledge  of  it.  We  assume  that  robots  do  not 
have  the  ability  to  perform  measurements  expressed  in  a  common  reference 
frame.  An  early  reference  where  relative  information  is  adopted  is  [Lin  et  al., 
2005], 


3.2.1  Kinematics  notions 


Because  robots  do  not  have  a  common  reference  frame,  all  the  measure¬ 
ments  generated  by  their  on-board  sensors  are  expressed  in  a  local  reference 
frame.  To  formalize  this  fact,  it  is  useful  to  review  some  basic  kinematics 
conventions.  We  let  Xfixed  =  (pfixed;  |a;fixed,  yfixed,  zfixed})  be  a  fixed  refer¬ 
ence  frame  in  M3.  A  point  q,  a  vector  v,  and  a  set  of  points  S  expressed 
with  respect  to  the  frame  Efixed  are  denoted  by  qaxed,  Ufixed  and  Sfixed,  re¬ 
spectively.  Next,  let  Eb  =  (pb,  {a;b,  y‘\  z°})  be  a  reference  frame  fixed  with 


Figure  3.5  Inertially-fixed  and  body-fixed  frames  in  R3 

a  moving  body.  The  origin  of  £b  is  the  point  ph,  denoted  by  pdxed  when 
expressed  with  respect  to  £fixed.  The  orientation  of  Xb  is  characterized  by 
the  d-dimensional  rotation  matrix  Rgxcd,  whose  columns  are  the  frame  vec- 
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tors  {ccb,2/b,zb}  of  Eb  expressed  with  respect  to  Efixcd.  We  recall  here  the 
definition  of  the  group  of  rotation  matrices  in  d-dimensions 

SO (d)  =  {RG  Rdxd  |  RRt  =  Id ,  det (R)  =  +1}. 

With  these  notations,  changes  of  reference  frames  are  described  by 

(/fixed  =  -RfixedQb  +  PfLed; 

^fixed  -^fixed^tn 

‘S'fixed  =  -^fixed^b  +  Pfixed’  (3.2.1) 

Note  that  these  change  of  frames  formulas  also  hold  in  the  planar  case  with 
the  corresponding  definition  of  rotation  matrix  in  SO (2). 

Remark  3.13  (Comparison  with  literature).  In  our  notation  the  sub¬ 
script  denotes  the  frame  with  respect  to  which  the  quantity  is  expressed. 
Other  references  in  the  literature  sometimes  adopt  the  opposite  convention 
in  which  the  superscript  denotes  the  frame  with  respect  to  which  the  quan¬ 
tity  is  expressed.  • 


3.2.2  Physical  components 

In  what  follows  we  describe  our  notion  of  mobile  robots  equipped  with  relative 
sensors.  We  consider  a  group  of  n  robots  moving  in  an  allowable  environ¬ 
ment  Q  C  Rd,  for  d  E  {2,3},  and  we  assume  that  a  reference  frame  E^, 
for  i  €  {1, . . . ,  n},  is  attached  to  each  robot,  see  Figure  3.6.  Expressed  with 
respect  to  the  fixed  frame  Efixed,  the  ?'th  frame  EM  is  described  by  a  position 
Infixed  ^  d  and  an  orientation  E  SO  (d).  The  continuous-time  motion 

and  discrete-time  sensing  models  are  described  as  follows. 

Motion  model:  We  select  a  simple  motion  model:  for  all  t  E  M>o,  the 

M  — 

orientation  is  constant  in  time  and  robot  i  translates  according  to 

#LdW  =  4‘Lxf'.  (5.2.2) 

\i]  . 

that  is,  the  zth  control  input  u\  is  known  and  applied  in  the  robot  frame. 
Each  control  input  uf\  i  E  {l,...,n},  takes  values  in  a  compact  input 
space  U.  Clearly,  it  would  be  possible  to  consider  a  motion  model  with 
time- varying  orientation  and  we  refer  the  reader  to  Exercise  E3.1  where  we 
do  so. 

Sensing  model:  At  each  discrete  time  instant,  robot  i  activates  a  sensor 
that  detects  the  presence  and  returns  a  measurement  about  the  relative  po¬ 
sition  of  any  object  (robots  or  environment  boundary)  inside  a  given  “sensor 
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Figure  3.6  Robotic  network  with  relative  sensing.  A  group  of  4  robots  moves  in  R2.  Each 
robot  i  €  {1, ...  ,4}  has  its  own  reference  frame  E  ^ . 


footprint.”  We  describe  the  model  in  two  steps.  First,  each  robot  measures 


Figure  3.7  Examples  of  sensor  footprints,  (a)  Cone-shaped  sensor  footprint  of  a  vehicle 
equipped  with  a  camera,  (b)  270-degree  wedge-shaped  sensor  footprint  of  a 
vehicle  equipped  with  a  laser  scanner. 


other  robots’  positions  and  the  environment  as  follows. 


Sensing  other  robots’  positions.  There  exists  a  set  Arbt  containing  the 
null  element,  called  the  sensing  alphabet ,  and  a  map  rbt-sns  :  — » 

Arbt,  called  the  sensing  function ,  with  the  interpretation  that  robot  i 
acquires  the  symbol  rbt-sns(pj^)  E  Arbt  for  each  robot  j  E  {1, . . . ,  n}  \ 
»■ 

Sensing  the  environment.  There  exists  a  set  Aenv  containing  the  null 
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element,  called  the  environment  sensing  alphabet ,  and  a  map  env-sns  : 
P(Mrf)  — ►  Aenv,  called  the  environment  sensing  function,  with  the  in¬ 
terpretation  that  robot  i  acquires  the  symbol  env-sns(Qj)  G  Aenv. 


Second,  we  let  C  be  the  sensor  footprint  of  robot  i  and  be  its 
expression  in  the  frame  S  W .  see  Figure  3.7.  For  simplicity,  we  assume  that  all 
robot  sensors  are  equal,  so  that  we  write  S\  =  S.  We  require  both  sensing 
functions  to  provide  no  information  about  robots  and  boundaries  that  are 
outside  S  in  the  following  two  meanings:  (i)  if  p  is  any  point  outside  S, 
then  rbt-sns(p)  =  null,  and  (ii)  if  W  is  any  subset  of  Mrf,  env-sns  (IF)  = 
env-sns(lF  n  S ). 

We  summarize  this  discussion  with  the  following  definition. 

Definition  3.14  (Network  with  relative  sensing).  The  physical  com¬ 
ponents  of  a  network  with  relative  sensing  consist  of  n  mobile  robots  with 
identifiers  {1, . . .  ,n},  with  configurations  in  Q  x  SO (d),  for  an  allowable  en¬ 
vironment  Q  C  Mrf,  with  dynamics  described  by  equation  (3.2.2),  and  with 
relative  sensors  described  by  the  sensor  footprint  S,  sensing  alphabets  Arbt 
and  Aenv,  and  sensing  functions  rbt-sns  and  env-sns.  • 

To  make  things  concrete,  let  us  present  two  examples  of  robotic  networks 
with  relative  sensing  that  are  analogs  of  the  “communication-based”  robotic 
networks  5disk  and  5vis-disk  in  Examples  3.4  and  3.6. 

Example  3.15  (Disk  sensor  and  corresponding  relative-sensing  net¬ 
work).  Given  a  sensing  range  r  G  M>o,  the  disk  sensor  has  sensor  footprint 
B(0d,r),  i.e. ,  a  disk  sensor  measures  any  object  (robot  and  environment 
boundary)  within  distance  r.  Regarding  sensing  of  other  robots,  we  assume 
that  the  alphabet  is  Arbt  =  M d  Ujnull}  and  that  the  sensing  function  is 
rbt-sns  (pj-^ )  =  p\^  for  each  robot  j  G  {1, . . . ,  n}  \  {*},  inside  the  sensor  foot¬ 
print  B(0d,r),  and  rbt-sns (/j^)  =  null,  otherwise.  Regarding  sensing  of 
the  environment,  we  assume  that  the  alphabet  is  Aenv  =  P(Krf)  and  that  the 
sensing  function  is  env-sns  (Qf)  =  Qi  n  B(0d,  r).  A  group  of  robots  with  disk 
sensors  defines  the  robotic  network  with  relative  sensing  |sk.  • 

Example  3.16  (Range-limited  visibility  sensor  and  corresponding 
relative-sensing  network).  Given  a  sensing  range  r  G  M>o,  the  range- 
limited  visibility  sensor  has  sensor  footprint  B(0d,  r)  and  performs  measure¬ 
ments  only  of  objects  within  unobstructed  line  of  sight.  Regarding  sensing 
of  other  robots,  we  assume  that  the  alphabet  is  Arbt  =  K^Ujnull}  and  that 
the  sensing  function  is  rbt-sns  (p^ )  =  p\^  for  each  robot  j  G  {1, ...  ,n}\  {*}, 
inside  the  range- limited  visibility  set  Vidisk(02;  Qi )>  and  rbt-sns (p^)  =  null, 
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otherwise.  Regarding  sensing  of  the  environment,  we  assume1  that  the  al¬ 
phabet  is  Aenv  =  P(Md)  and  that  the  environment  sensor  measures  the  range- 
limited  visibility  set  Vicj ;sk (PflJ,ecj ;  Q )  expressed  with  respect  to  the  frame  E M ; 
see  Section  2.1.2  for  the  definition  of  range-limited  visibility  set.  In  other 
words,  the  environment  sensing  function  is  env-sns(Q,;)  =  Vidisk(02;  Qi)- 
This  is  illustrated  in  Figure  3.8.  A  group  of  robots  with  range-limited  visi- 


Figure  3.8  The  left  plot  depicts  the  range- limited  visibility  set  Vidisk(pji].ed;  Q)  expressed 
with  respect  to  an  inertially  fixed  frame.  The  right  plot  depicts  the  range- 
limited  visibility  set  expressed  with  respect  to  the  body-fixed  frame  E  M ,  that 

is,  VIdisk(02j 


bility  sensors  defines  the  robotic  network  with  relative  sensing  5^is-disk-  * 

Remark  3.17  (Sensing  model  consequences).  The  proposed  sensing 
model  has  the  following  two  consequences: 

(i)  Robots  have  no  information  about  the  absolute  position  and  orienta¬ 
tion  of  themselves,  the  other  robots  or  any  part  of  the  environment; 
and 

(ii)  the  relative  sensing  capacity  of  the  robots  gives  rise  to  a  proximity 

graph,  called  the  sensing  graph ,  whose  edges  are  the  collection  of 
robot  pairs  that  are  within  sensing  range.  For  example,  in  the  net¬ 
work  5j-sk,  the  sensing  graph  is  the  disk  graph  f?disk(r)-  In  general, 
sensing  graphs  are  directed.  • 


3.2.3  Relative-sensing  control  laws 

As  we  did  for  robotic  networks  with  interactions  based  on  communication, 
we  define  here  control  laws  based  on  relative  sensing  and  describe  the  closed- 
loop  evolution  of  robotic  networks  with  relative  sensing. 


Tt  would  be  equivalent  to  assume  that  the  robot  can  sense  every  portion  of  OQ  that  is  within 
distance  r  and  that  is  visible  from  the  robot  position. 
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First,  we  consider  a  robotic  network  with  relative  sensing  Svs  characterized 
by:  identifiers  {l,...,n},  configurations  in  Q  x  SO (d),  for  an  allowable 
environment  Q  C  Md,  dynamics  described  by  equation  (3.2.2),  and  relative 
sensors  described  by  the  sensor  footprint  S ,  sensing  alphabets  Arbt  and 
Aenv,  and  sensing  functions  rbt-sns  and  env-sns.  A  relative- sensing  control 
law  1ZSC  for  the  robotic  network  with  relative  sensing  Srs  consists  of  the 
tuple. 

(i)  W,  called  the  processor  state  set,  with  corresponding  set  of  allowable 
initial  values  Wo  C  W; 

(ii)  stf  :  W  x  A”bt  x  Aenv  — >  W,  called  the  (processor)  state-transition 
function ;  and 

(iii)  ctl  :  W  x  A”bt  x  Aenv  — >  U,  called  the  (motion)  control  function. 

As  for  robotic  networks,  we  say  that  7 ZSC  is  static  if  W  is  a  singleton  for 
all  i  G  {1, . . . ,  n};  in  this  case  1ZSC  can  be  described  by  a  motion  control 
function  ctl  :  A”bt  x  Aenv  — ►  U.  Additionally,  if  the  environment  Q  =  Md, 
then  7 ZSC  can  be  described  by  a  motion  control  function  ctl  :  W  x  A”bt  — >  U . 

Second,  the  evolution  of  (SVS,7ZSC)  from  initial  conditions  (Po^,-^flled)  ^ 
Rd  x  SO (d)  and  G  Wo,  i  G  {1, . . . ,  n},  is  the  collection  of  curves  pbJ<ed  : 

M>o  — >  and  uM  :  Z>o  — >  W,  i  G  {1, . . . ,  n},  defined  by 

4xed  (*)  =  ^fixed  Ctl  (w  W  (  L* J  ) .  V  W  (  L* J  ) ,  VeL  (  W  )  )  , 

w^(£)  =  Sti(w^(e-l),y^(£),y^vm 

with  p&Ld(°) =  Po’ and  =  ,  i  G  {l,...,n}.  In  the  previous 

equations,  yW  :  Z>o  — >  A"bt  (describing  the  robot  measurements  taken  by 
sensor  i)  with  components  for  j  G  {1 ,n},  and  yinv  :  Z>o  — >  Aenv 

(describing  the  environment  measurements  taken  by  sensor  i)  are  defined  by 

yf  (0  =  rbt-sns  (p^  {£)),  y[flv(£)  =  env-sns  (Qi(£)). 

In  the  last  equation,  and  Qi(£)  denote  the  position  of  the  j-th  robot  and 
the  environment  Q  as  expressed  with  respect  to  the  moving  frame  SW. 


3.2.4  Equivalence  between  control  and  communication  laws  and  relative¬ 
sensing  control  laws 


Consider  a  “communication-based”  robotic  network  with  control  and 
communication  law  CC\  with  the  following  properties: 
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(i)  Regarding  <Si:  the  network  is  uniform,  the  state  space  is  X  =  M.d 
with  states  denoted  by  x'M  =  pM,  the  communication  graph  is  the 
r-disk  graph,  and  the  robot  dynamics  is  jjM  =  y\l\ ;  and 

(ii)  regarding  CC\\  the  control  and  communication  law  is  uniform  and 
data-sampled,  the  communication  alphabet  is  A  =  MrfU{null},  and 
the  message-generation  function  is  msg (p,w,j)  =  p. 

Given  a  network  and  a  law  (<Si,CCi)  satisfying  (i)  and  (ii),  the  control 
and  communication  law  CC\  is  invariant  if  its  state  transition  and  control 
maps  satisfy,  for  all  p  £  Rd,  w  £  W,  y  £  An  and  R  £  SO (d), 

stf (p,  w,  y )  =  stf(Od,  w,  R(y  -  p)) , 
ctl(p,  w,  y)  =  RTctl(Od,  w,  R{y  -  p)) , 

where  the  ith  component  of  R(y  —  p)  £  An  is  R(yi  —  p)  if  yi  £  M.d,  or  null 
if  yi  =  null. 

Next,  consider  a  relative-sensing  network  S2  with  disk  sensors  as  in  Ex¬ 
ample  3.15,  that  is,  assume  the  sensing  footprint  is  B(0d,r),  the  sensing 
alphabet  is  Arbt  =  Ujnull}  and  the  sensing  function  equals  the  identity 
function  in  R(0^,r).  We  assume  no  environment  sensing  as  we  set  Q  =  Wl. 
The  communication  and  control  law  CC  \  and  the  relative-sensing  control  law 
RSC2  for  network  S2  are  equivalent  if  their  processor  state  sets  identical, 
e.g.,  denote  both  by  W,  and  their  state  transition  and  control  maps  satisfy, 
for  all  w  £  W  and  y  £  MdU{null}  =  An  =  A”bt, 

stfi(0<2,  w,  y)  =  stf 2(w,y),  and  ctli(Od,  w,  y)  =  ctl2(w,y). 

Proposition  3.18  (Evolution  equivalence).  If  CC\  is  invariant  and  if 
CC\  and1ZSC2  are  equivalent,  then  the  evolutions  of  (Si,CCi)  and  ( S2,RSC2 ) 
from  identical  initial  conditions  are  identical. 


Proof.  Assume  that  the  messages  and  measurements  array  y^(t)  received 
by  the  i-th  robot  at  time  t  in  the  communication-based  network  and  in  the 
relative-sensing  networks  are  equal  to,  respectively: 


pm 


’■Pfixld’ 


and 


Pi 


.Pi 


Then,  the  evolution  of  the  communication-based  network  and  of  the  relative¬ 
sensing  networks  are  written  as,  respectively 

pSL  =  ^(pLd.^.p&ii.  •  •  • 

pLd  =  Cd^^,??11, . .  ,p[?k]). 
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From  equation  (3.2.1)  we  know  that,  for  all  j  £  {j\, . . .  ,jk}, 

P[lled  =  4xed  pf  +  PfLed  =>  P f’  =  (^fLedf  (Pfiled  ~  Pfixed)' 

From  this  equality  and  from  the  fact  that  CCi  is  invariant,  we  observe  that 

Ctll  (^fixed ’  w{A  >  Pfixld>  •  '  '  -  Pfixid)  =  ^LdCtii  (0d,  W [i] ,  ppl] ,  .  .  •  ,  ) . 


Since  CC\  and  PISC2  are  equivalent,  the  two  evolution  equations  coincide. 
A  similar  reasoning  shows  that  also  the  evolutions  of  the  processor  states 
are  identical.  ■ 


Remark  3.19  (Communication-based  laws  on  relative-sensing  net¬ 
works).  Proposition  3.18  implies  the  following  fact.  Given  an  invariant 
control  and  communication  law  for  a  robotic  network  satisfying  all  appro¬ 
priate  properties,  the  control  and  communication  law  can  be  implemented 
on  an  appropriate  relative-sensing  network  as  a  relative-sensing  control  law. 


3.3  COORDINATION  TASKS  AND  COMPLEXITY  NOTIONS 

In  this  section  we  introduce  concepts  and  tools  useful  to  analyze  a  com¬ 
munication  and  control  law  in  a  robotic  network;  our  treatment  is  directly 
generalized  to  relative-sensing  networks.  We  address  the  following  questions: 
What  is  a  coordination  task  for  a  robotic  network?  When  does  a  control 
and  communication  law  achieve  a  task?  And  with  what  time,  space,  and 
communication  complexity? 


3.3.1  Coordination  tasks 

Our  first  analysis  step  is  to  characterize  the  correctness  properties  of  a  com¬ 
munication  and  control  law.  We  do  so  by  defining  the  notion  of  task  and  of 
task  achievement  by  a  robotic  network. 

Definition  3.20  (Coordination  task).  Let  S  be  a  robotic  network  and 
let  W  be  a  set. 

(i)  A  coordination  task  for  S  is  a  map  T :  X  M  xWn  — >  {true,  false}. 

(ii)  If  W  is  a  singleton,  then  the  coordination  task  is  said  to  be  static 
and  can  be  described  by  a  map  T :  riie/  X M  — >  {true,  false}. 

Additionally,  let  CC  a  control  and  communication  law  for  S. 

(i)  The  law  CC  is  compatible  with  the  task  T  :  \\ieIX^  x  Wn  — > 
{true,  false}  if  its  processor  state  take  values  in  W,  that  is,  if 
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W®  =  W,  for  all  i  €  I. 

(ii)  The  law  CC  achieves  the  task  T  if  it  is  compatible  with  it  and  if,  for 

w  u]  [ji  [ji 

all  initial  conditions  Xq  E  Vq  and  u>q  E  Wq  ,  i  E  /,  there  exists 
T  E  M>o  such  that  the  network  evolution  t  i— >  (x(t),w(t))  has  the 
property  that  T(x(t),  w(t))  =  true  for  all  t  >T.  • 

Remark  3.21  (Temporal  logic).  Loosely  speaking,  achieving  a  task  means 
obtaining  and  maintaining  a  specified  pattern  in  the  robot  physical  or  pro¬ 
cessor  state.  In  other  words,  the  task  is  achieved  if  at  some  time  and  for  all 
subsequent  times  the  predicate  evaluates  to  true  along  system  trajectories. 
It  is  possible  to  consider  more  general  tasks  based  on  more  expressive  pred¬ 
icates  on  trajectories.  Such  predicates  can  be  defined  through  various  forms 
of  temporal  and  propositional  logic,  e.g.,  see  [Emerson,  1994].  In  particular, 
(linear)  temporal  logic  contains  certain  constructs  that  allow  reasoning  in 
terms  of  time  and  is  hence  appropriate  for  robotic  applications,  as  argued 
for  example  by  Fainekos  et  al.  [2005].  Network  tasks  such  as  periodically 
visiting  a  desired  set  of  configurations  can  be  encoded  with  temporal  logic 
statements.  • 


Example  3.22  (Direction  agreement  and  equidistance  tasks).  From 
Example  3.7,  consider  the  uniform  network  5c;rcie  of  locally-connected  first- 
order  agents  in  S1.  From  Section  3.1.3,  recall  the  AGREE  &;  PURSUE  control 
and  communication  law  CCAgree  &  pursue  with  processor  state  taking  values 
in  W  =  {cc,c}  x  I.  There  are  two  tasks  of  interest.  First,  we  define  the 
direction  agreement  task  7dir  :  (S1)”  x  Wn  — >  {true,  false}  by 


%iT{6,w) 


true, 

false, 


if  dirW  = 
otherwise, 


dirW, 


where  9  =  (0M, . . . ,  0^),  w  =  (u;M, . . . ,  u;^),  and  =  (dirW, max-id^), 
for  i  E  /.  Furthermore,  for  e  >  0,  we  define  the  static  equidistance  task 
7{-eqdstnc  :  (S1)”  ~ ^  {true,  false}  to  be  true  if  and  only  if 


min distc (0^,0^)  —  mindistcc(0W, 0^)|  <  e,  for  all  i  E  I. 


In  other  words,  7{:_eqdstnc  is  true  when,  for  every  agent,  the  distance  to  the 
closest  clockwise  neighbor  and  to  the  closest  counterclockwise  neighbor  are 
approximately  equal.  • 


3.3.2  Complexity  notions 

We  are  finally  ready  to  define  the  key  notions  of  time,  space  and  communica¬ 
tion  complexity.  These  notions  describe  the  cost  that  a  certain  control  and 
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communication  law  incurs  while  completing  a  certain  coordination  task.  We 
also  define  the  complexity  of  a  task  to  be  the  infimum  of  the  costs  incurred 
by  all  laws  that  achieve  that  task.  We  begin  with  a  remark  highlighting  a  dif¬ 
ference  between  what  follows  and  the  complexity  treatment  for  synchronous 
networks. 

Remark  3.23  (Termination  via  task  completion).  As  discussed  in  Re¬ 
mark  1.41  in  Section  1.5,  it  is  possible  to  consider  various  algorithm  termi¬ 
nation  notions.  Here  we  will  establish  the  completion  of  an  algorithm  as  the 
instant  when  a  given  task  is  achieved.  • 

First,  we  define  the  time  complexity  of  an  achievable  task  as  the  minimum 
number  of  communication  rounds  needed  by  the  agents  to  achieve  the  task  T. 

Definition  3.24  (Time  complexity).  Let  S  be  a  robotic  network  and  let 
T  be  a  coordination  task  for  S.  Let  CC  be  a  control  and  communication  law 
for  S  compatible  with  T. 

(i)  The  ( worst-case )  time  complexity  to  achieve  T withCC  from  (xq,  w o)  £ 

nief41xnI^f  ^ 

TC(7)  CC,  xo,  wo)  =  inf  {£  |  T(x(k),w(k))  =  true  ,  for  all  k  >  £}, 

where  t  i— >  (x(t),w(t))  is  the  evolution  of  ( S,CC )  from  the  initial 
condition  (xq,wq)] 

(ii)  the  (worst-case)  time  complexity  to  achieve  T  with  CC  is 
TC(T,CC)  =sup  {tC(T,CC,x0,w0)  |  (x0, u>o)  <E  II Xo  x  11^}; 

iel  i&I 

(iii)  the  ( worst-case )  time  complexity  of  T is 

TC(7)  =  inf{TC(T,  CC)  \  CC  compatible  with  T}.  • 


Next,  we  quantify  memory  and  communication  requirements  of  commu¬ 
nication  and  control  laws.  We  assume  that  elements  of  the  processor  state 
set  W  or  of  the  alphabet  set  A  might  amount  to  multiple  “basic  memory 
units”  or  “basic  messages.”  We  let  |VF|basic  and  |A|t,asic  denote  the  number 
of  basic  memory  units  and  basic  messages  required  to  represent  elements 
of  W  and  A,  respectively.  The  null  message  has  zero  cost.  To  clarify  this 
assumption,  we  adopt  two  conventions.  First,  as  in  Section  1.5.2,  we  assume 
that  a  “basic  memory  unit”  or  a  “basic  message”  contains  log(n)  bits.  This 
implies  that  the  log(n)  bits  required  to  store  or  transmit  a  robot  identifier 
i  £  {1, . . .  ,n}  are  equivalent  to  one  “basic  memory  unit.”  Second,  as  men¬ 
tioned  in  Remark  3.11,  we  assume  that  a  processor  can  store  and  transmit 
a  (finite  number  of)  integer  and  real  numbers  and  we  adopt  the  convention 
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that  any  such  number  is  quantized  and  represented  by  a  constant  number 
of  basic  memory  units  or  basic  messages. 

We  now  quantify  memory  requirements  of  algorithms  and  tasks  by  count¬ 
ing  the  required  number  of  basic  memory  units.  Let  the  network  S,  the  task 
T  and  the  control  and  communication  law  CC  be  as  in  Definition  3.24. 

Definition  3.25  (Space  complexity).  (i)  The  ( worst-case )  space  com¬ 
plexity  to  achieve  T with  CC,  denoted  by  SC(T,  CC),  is  the  maximum 
number  of  basic  memory  units  required  by  a  robot  processor  exe¬ 
cuting  the  CC  on  S  among  all  robots  and  among  all  allowable  initial 
physical  and  processor  states  until  termination;  and 

(ii)  the  space  complexity  of  T  is  the  infimum  among  the  space  complex¬ 
ities  of  all  control  and  communication  laws  that  achieve  T.  • 

The  set  of  all  non-null  messages  generated  during  one  communication 
round  from  network  state  (x,  w )  is  denoted  by 

M(x,w)  =  {( i,j )  G  Ecmm(x)  |  msgW(xW,u;W,  j)  /  null}. 

We  now  quantify  mean  and  total  communication  requirements  of  algorithms 
and  tasks  by  counting  the  number  of  transmitted  basic  messages. 

Definition  3.26  (Mean  and  Total  Communication  complexity).  (i) 

The  (worst-case)  mean  communication  complexity  and  the  (worst- 
case)  total  communication  complexity  to  achieve  T  with  CC  from 

(^0)  wo)  £  Utei  4  ^  x  Re/  Wq*1  are,  respectively, 

MCC(T,CC,x0,w0)  =  V|M(x(4H0)|, 

T  z ' 

e=o 

T—  1 

TCC(T,CC,x0,w0)  =  | A | basic  ^  ,w(l))\, 

e=o 

where  t  t— >  (x(t),w(t))  is  the  evolution  of  (S,CC)  from  the  initial 
condition  (xo,ico)  an<^  where  r  =  TC{CC ,T,xq,wq).  Here,  MCC  is 
defined  only  for  initial  conditions  (xq,wq)  with  the  property  that 
T{xq,wq)  =  false; 

(ii)  the  (worst-case)  mean  communication  complexity  (resp.  the  (worst- 
case)  total  communication  complexity )  to  achieve  T  with  CC  is  the 
supremum  of  MCC(7}  CC,  xo,  wq)  (resp.  TCC(T,  CC,  xq,  wq))  over 
all  allowable  initial  states  (xo,tco);  and 

(iii)  the  (worst-case)  mean  communication  complexity  (resp.  the  (worst- 
case)  total  communication  complexity )  of  T  is  the  infimum  among 
the  mean  communication  complexity  (resp.  the  total  communica- 
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tion  complexity)  of  all  control  and  communication  laws  that  achieve 

r.  • 

By  construction,  one  can  verify  that  it  always  happens  that 

TCC(T,CC )  <  MCC (T,CC)  •  TC (T,CC).  (3.3.1) 

We  conclude  this  section  with  possible  variations  and  extensions  of  the  com¬ 
plexity  definitions. 

Remark  3.27  (Infinite- horizon  mean  communication  complexity). 

The  mean  communication  complexity  MCC  measures  the  average  cost  of  the 
communication  rounds  required  to  achieve  a  task  over  a  finite  time  horizon; 
a  similar  statement  holds  for  the  total  communication  complexity  TCC.  One 
might  be  interested  in  a  notion  of  mean  communication  complexity  required 
to  maintain  true  the  task  for  all  times.  Accordingly,  the  infinite-horizon 
mean  communication  complexity  of  CC  from  initial  conditions  (xq,wq)  is 

IH-MCC(CC,  x0,  u>0)  =  lim  ^basic  V  \M{x(£),  w(£))\.  • 

T— >+oc  T  ^ ' 

£=0 

Remark  3.28  (Communication  complexity  in  omnidirectional  net¬ 
works).  In  omnidirectional  wireless  networks  the  standard  operation  mode 
is  for  all  neighbors  of  a  node  to  receive  the  signal  it  transmits.  In  other 
words,  the  transmission  is  omnidirectional  rather  than  unidirectional.  It 
is  straightforward  to  require  the  message  generation  function  to  have  the 
property  that  the  output  it  generates  be  independent  of  the  intended  re¬ 
ceiver.  Under  such  assumptions,  it  make  sense  to  count  as  communication 
complexity  not  the  number  of  messages  transmitted  in  the  network,  but  the 
number  of  transmissions,  i.e. ,  a  unit  cost  per  node  rather  than  a  unit  cost 
per  edge  of  the  network.  • 

Remark  3.29  (Energy  complexity).  Given  a  model  for  the  energy  con¬ 
sumed  by  the  robot  to  move  and  to  transmit  a  message,  one  can  easily 
define  a  notion  of  energy  complexity  for  a  control  and  communication  law. 
In  modern  wireless  transmitters,  the  energy  consumptions  in  transmitting 
a  signal  at  a  distance  r  varies  with  a  power  of  r.  Analogously,  energy  con¬ 
sumption  is  an  increasing  function  of  distance  traveled.  We  consider  this  to 
be  a  promising  avenue  for  further  research.  • 


3.3.3  Invariance  under  rescheduling 

Here  we  discuss  the  invariance  properties  of  time  and  communication  com¬ 
plexity  under  the  rescheduling  of  a  control  and  communication  law.  The 
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idea  behind  rescheduling  is  to  “spread”  the  execution  of  the  law  over  time 
without  affecting  the  trajectories  described  by  the  robots. 

For  simplicity  we  consider  the  setting  of  static  laws;  similar  results  can 
be  obtained  for  the  general  setting.  Also,  for  ease  of  presentation,  we  allow 
our  communication  and  control  laws  to  be  time  dependent,  i.e.,  we  con¬ 
sider  message-generation  functions  and  motion  control  functions  of  the  form 
msgW  :  Z>0  x  x/-»  A  and  ctl^  :  M>0  x  X®  x  X®  xA"-*  U®,  respec¬ 
tively.  The  Definition  3.10  of  network  evolution  can  be  readily  extended  to 
this  more  general  time-dependent  setup. 

Let  S  =  (I ,F,  Ecmm)  be  a  robotic  network  where  each  mobile  robot  is  a 
driftless  control  system,  as  defined  in  Section  1.3.  Let  CC  =  (A,  {msgW}jS/,  {ctl^}je/) 
be  a  static  control  and  communication  law.  In  what  follows,  we  define  a 
new  control  and  communication  law  by  modifying  CC ;  to  do  so  we  intro¬ 
duce  some  notation.  Let  s£N,  with  s  <  n,  and  let  Vj  =  {Iq,  . . . ,  7,s_i}  be 
an  s-partition  of  /,  that  is,  Io,...,Is-i  C  I  are  disjoint  and  nonempty 
and  /  =  UjAg/fc.  For  i  E  I,  define  the  message-generation  functions 
msg^  :  Z>o  x  I'M  x  /  — >  A  by 

msg^7(^,  x,j)  =  msgW(L^/sJ  ,x,j),  (3.3.2) 

if  i  E  Ik  and  k  =  t  mod  s,  and  msg ^  (£,x,j)  =  null  otherwise.  According 
to  this  message-generation  function,  only  the  agents  with  unique  identifier 
in  Ik  will  send  messages  at  time  £,  where  £  S  {k  +  as}a£z>0.  Equivalently, 
this  can  be  stated  as  follows:  according  to  (3.3.2),  the  messages  originally 
sent  at  the  time  instant  £  are  now  rescheduled  to  be  sent  at  the  time  instants 
F{£)  —  s  +  1, . . . ,  F{£),  where  F  :  Z>o  — >  Z>o  is  defined  by  F(£)  =  s(£+l)  —  l. 

Figure  3.9  illustrates  this  idea.  For  is/,  define  the  control  functions 


£  £+1 


H 


Figure  3.9  Under  the  rescheduling,  the  messages  that  are  sent  at  the  time  instant  £  under 
the  control  and  communication  law  CC  are  rescheduled  to  be  sent  over  the 
time  instants  F(£)  —  s  +  1, . . . ,  F(£)  under  the  control  and  communication  law 
cC(„,p7). 
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ctl[i]  :  M>0  x  XM  x  xM  xAn^  U®  by 

ctlpr  (t,  x,  xsmpid,  y)  =ctl[il  (t  -  t  +  -F_1(i),x,xsmpid,2/)  ,  (3.3.3) 

if  7  €  [£,£  +  1]  and  i  =  —  1  mod  s  and  ctlp  (t,  x,  ®smpid,  y)  =  0  otherwise. 
Here  F~1  :  Z>o  — >  Z>o  is  the  inverse  of  F,  defined  by  F~1(l)  =  —  1. 

Roughly  speaking,  the  control  law  ctlp;  makes  the  agent  i  wait  for  the  time 
intervals  with  l  G  {as  —  l}aeN)  to  execute  any  motion.  Accordingly, 

the  evolution  of  the  robotic  network  under  the  original  law  CC  during  the 
time  interval  \£,  £  +  1]  now  takes  place  when  all  the  corresponding  messages 
have  been  transmitted,  i.e. ,  along  the  time  interval  [F(£),  F(£)  +  1],  The 
following  definition  summarizes  this  construction. 

Definition  3.30  (Rescheduling  of  control  and  communication  laws). 

Let  S  =  (1, 77,  BCmm)  be  a  robotic  network  with  driftless  physical  agents, 
and  let  CC  =  (Z>o,  A,  {msgW},e/,  {ctl^jie/)  be  a  static  control  and  commu¬ 
nication  law.  Let  s  G  N,  with  s  <  n,  and  let  Vi  be  an  s-partition  of  /.  The 
control  and  communication  law  CC^S  Fi)  =  (Z>0,A,  {msg^}ie/,{ctl^}i6/) 
defined  by  equations  (3.3.2)  and  (3.3.3)  is  called  a  V / -rescheduling  of  CC.  • 

The  following  result  whose  proof  is  presented  in  Section  3.6.1  shows  that 
the  total  communication  complexity  of  CC  remains  invariant  under  reschedul¬ 
ing. 

Proposition  3.31  (Complexity  of  rescheduled  laws).  With  the  as¬ 
sumptions  of  Definition  3.30,  let  T :  Re/  XM  — >  {true,  false}  be  a  coordi¬ 
nation  task  for  S.  Then,  for  all  xq  G  IX;e/ 

TC(T, x0)  =  s  •  TC(T,CC,x0) . 

Moreover,  if  Crnd  is  additive,  then,  for  all  xq  E 

MCC  (FCC(StVl),x0)  =  l--MCC(TXC,x0), 

and  therefore,  TCC(7}  CC(Sj-p7),  xq)  =  TCC(T)  CC,  xq),  i.e.,  the  total  com¬ 
munication  complexity  of  CC  is  invariant  under  rescheduling. 

Remark  3.32  (Appropriate  complexity  notions  for  driftless  agents). 

Given  the  results  in  the  previous  theorem,  one  should  be  careful  in  choosing 
what  notion  of  communication  complexity  to  evaluate  control  and  commu¬ 
nication  laws.  For  driftless  physical  agents,  rather  than  the  mean  communi¬ 
cation  complexity  MCC,  one  should  really  consider  the  total  communication 
complexity  TCC,  since  the  latter  is  invariant  with  respect  to  rescheduling. 
Note  that  the  notion  of  infinite-horizon  mean  communication  complexity 
IH-MCC  defined  in  Remark  3.27  satisfies  the  same  relationship  as  MCC, 
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that  is,  IH-MCC(CC(ai^),x0)  =  \  IH-MCC(CC,  ®0)-  • 

3.4  COMPLEXITY  OF  DIRECTION  AGREEMENT  AND 
EQUIDISTANCE 

From  Example  3.7,  Section  3.1.3,  and  Example  3.22,  recall  the  definition 
of  uniform  network  5circie  of  locally-connected  first-order  agents  in  S1,  the 
agree  &  PURSUE  control  and  communication  law  CCAGrEe  &  pursue,  and  the 
two  coordination  tasks  Ta±t  and  77_eqdstnc-  In  this  section,  we  characterize 
the  complexity  to  achieve  these  coordination  tasks  with  CCAGREE  &  pursue- 
Because  the  number  of  bits  required  to  represent  the  variable  max-id  E 
{1, . . .  ,n}  is  log(n),  Note  that  the  space  complexity  of  CCAGREE  &  pursue  is 
log(n)  bits,  that  is  one  basic  memory  unit  in  our  convention  discussed  in 
Section  3.3.2. 

Motivated  by  Remark  3.8,  we  model  wireless  communication  congestion 
by  assuming  that  the  communication  range  is  a  monotone  non-increasing 
function  r  :  N  — >  ]0,  n[  of  the  number  of  agents  n.  Likewise,  we  assume  that 
the  maximum  control  amplitude  umax  is  a  non-increasing  function  umax  : 

N  — >  ] 0, 1  [;  recall  that  umax  is  the  maximum  robot  speed.  Finally,  it  is 
convenient  to  define  the  function  n  i— >  5(n)  =  nr(n )  —  2n  E  R  that  compares 
the  sum  of  the  communication  ranges  of  all  the  robots  with  the  length  of 
the  unit  circle. 

We  are  now  ready  to  state  the  main  result  of  this  section;  proofs  are 
postponed  to  Section  3.6.2. 

Theorem  3.33  (Time  complexity  of  agree-and-pursue  law).  For  kpmp  E 
] 0,  ?;[,  in  the  limit  as  n  — >  +oo  and  £  — >  0+,  the  network  5circie  with 
Rmax(^)  '  kpropr(n),  the  law FFAGREE  ^  pursue,  and  the  tasks  Tiir  and  T-eqdstnc 
together  satisfy  the  following  properties: 

(i)  TC(7dir, CCagree  &  pursue)  £  &(r(n)  1);  and 
(ii)  if  5 (n)  is  lower  bounded  by  a  positive  constant  as  n  —r  +oo,  then 

TC(7e-eqdstnc,  CC agree  &  pursue)  £  log(ne)  1), 

TC  (Te 

-eqdstnc,  CCAGREE  &  pursue)  e  0(n2  log (ne  1)). 

IfS(n)  is  upper  bounded  by  a  negative  constant,  thenCCAGREB  &  pursue 
does  not  achieve  7^_eqdstnc  'm  general. 

Next,  we  study  the  total  communication  complexity  of  the  agree-and- 
pursue  control  and  communication  law.  First,  we  note  that  any  message  in 
A  =  SX  x  {cc,  c}  x  {1, . . . ,  n}  Ujnull}  requires  only  a  finite  number  of  basic 
message  to  encode,  that  is,  | A | basic  G  0(1). 
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Theorem  3.34  (Total  communication  complexity  of  agree-and-pur- 
sue  law).  For  kpmp  £  ]0,  |[;  in  the  limit  as  n  — ►  +oo  and  e  — >  0+,  the 
network  cScircie  with  umSLX(n)  >  kpiopr(n),  the  law  CCAG REE  &  pursue,  and  the 
tasks  7^ir  and  7^-eqdstnc  together  satisfy  the  following  properties: 

(i)  If  S(n)  >  7r(l/fcprop  —  2)  as  n  — >  +oo,  then 

TCC(7dirj  CC AGREE  &  pursue)  €  ©(nV(n)-1), 
otherwise  if  5(n)  <  7r(l/fcprop  —  2)  as  n  — >  +oo,  f/ien 

TCC(7dir,  CC  agree  &  pursue)  G  +  nr(n)  1), 

TCC(/?dir ,  CCAGREE  pursue)  S  0(n2r(n)_1); 

(k)  if  5(n)  is  lower  bounded  by  a  positive  constant  as  n  —>  +oo,  i/ien 
TCC(7^_eqCistnc,  CCAGREE  ^  pursue)  ^  fl(n3<i(n)  log(ne)  ), 
TCC(/7^_eqCistnc;  CCAGREE  &  pursue)^  0(n4log(ne  3)). 

Remark  3.35  (Comparison  with  leader  election).  Let  us  compare 
the  agree-and-pursue  control  and  communication  law  with  the  classical  Le 
Lann-Chang-Roberts  (LCR)  algorithm  for  leader  election  discussed  in  Sec¬ 
tion  1.5.4.  The  leader  election  task  consists  of  electing  a  unique  agent  among 
all  agents  in  the  network,  therefore,  it  is  different  from,  but  closely  related 
to,  the  coordination  task  7dir-  The  LCR  algorithm  operates  on  a  static 
network  with  the  ring  communication  topology,  and  achieves  leader  elec¬ 
tion  with  time  and  total  communication  complexity,  respectively,  0(n)  and 
0(n2).  The  agree-and-pursue  law  operates  on  a  robotic  network  with  the 
r(n)-disk  communication  topology,  and  achieves  Tm  with  time  and  total 
communication  complexity,  respectively,  ©(r(n)^1)  and  0(n2r(n)_1).  If 
wireless  communication  congestion  is  modeled  by  r{n)  of  order  1/n  as  in 
Remark  3.8,  then  the  two  algorithms  have  identical  time  complexity  and 
the  LCR  algorithm  has  better  communication  complexity.  Note  that  com¬ 
putations  on  a  possibly  disconnected,  dynamic  network  are  more  complex 
than  on  a  static  ring  topology.  • 

3.5  NOTES 

The  literature  on  multi-robot  systems  is  very  extensive.  Examples  include 
the  survey  by  Cao  et  al.  [1997],  the  text  by  Arkin  [1998]  on  behavior-based 
robotics,  and  the  recent  special  issues  [Arai  et  al.,  2002,  Abdallah  and  Tanner, 
2007].  Together  with  this  literature,  the  starting  point  to  develop  the  ma¬ 
terial  of  this  chapter  are  the  standard  notions  of  synchronous  and  asyn¬ 
chronous  networks  in  distributed  [Lynch,  1997,  Peleg,  2000,  Tel,  2001]  and 
parallel  [Bertsekas  and  Tsitsiklis,  1997,  Parhami,  1999]  computation.  This 
established  body  of  knowledge  on  networks  is,  however,  not  applicable  to 
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the  robotic  network  setting  because  of  the  agents’  mobility  and  the  ensuing 
dynamic  communication  topology. 

An  early  influential  contribution  towards  a  network  model  of  mobile  in¬ 
teracting  robots  is  the  work  by  Suzuki  and  Yamashita  [1999].  This  model 
consists  of  a  group  of  identical  “distributed  anonymous  mobile  robots”  char¬ 
acterized  as  follows:  no  explicit  communication  takes  place  between  them, 
and  at  each  time  instant  of  an  “activation  schedule,”  each  robot  senses  the 
relative  position  of  all  other  robots  and  moves  according  to  a  pre-specified 
algorithm.  An  artificial  intelligence  approach  to  multi-agent  behavior  in 
a  shared  environment  is  taken  in  [Moses  and  Tennenholtz,  1995].  Santoro 
[2001]  provides,  with  an  emphasis  on  computer  science  aspects,  a  brief  sur¬ 
vey  of  models,  algorithms,  and  the  need  for  appropriate  complexity  notions. 
Recently,  a  notion  of  communication  complexity  for  control  and  commu¬ 
nication  algorithms  in  multi-robot  systems  is  analyzed  by  Klavins  [2003], 
see  also  [Klavins  and  Murray,  2004].  Notions  of  failures  and  robustness  in 
robotic  networks  are  discussed  by  Gupta  et  al.  [2006b].  From  a  broad  hybrid 
networked  systems  viewpoint,  our  robotic  network  model  can  be  regarded  as 
special  cases  of  the  general  modeling  paradigms  discussed  in  [Lynch  et  al., 
2003,  Lygeros  et  al.,  2003,  Sanfelice  et  ah,  2007]. 

A  key  feature  of  the  synchronous  robotic  network  model  proposed  in  this 
chapter  is  the  adoption  of  proximity  graphs  from  computational  geometry 
as  a  basis  for  our  communication  model.  This  design  choice  is  justified  by 
the  vast  wireless  networking  literature,  where  this  assumption  is  made.  The 
simplest  communication  model,  in  which  two  robots  communicate  only  if 
they  are  within  a  fixed  communication  range,  is  a  common  model  adopted 
studied,  for  example,  in  the  works  [Gupta  and  Kumar,  2000,  Santi,  2005, 
Lloyd  et  ah,  2005,  Li,  2003].  These  works  study  the  proximity  graph  so¬ 
lutions  to  various  communication  optimization  problems;  this  discipline  is 
referred  to  as  topology  control ,  cf.  Remark  3.8.  Although  we  focus  our 
presentation  on  the  topological  aspect  of  the  communication  service,  more 
realistic  communication  models  would  include  randomness,  packet  losses, 
coding,  quantization,  and  delays,  e.g.,  see  [Toh,  2001,  Tse  and  Viswanath, 
2005], 

Next,  we  review  some  literature  on  emergent  and  self-organized  swarming 
behaviors  in  biological  groups.  Interesting  dynamical  systems  arise  in  bio¬ 
logical  networks  at  multiple  levels  of  resolution,  all  the  way  from  interactions 
among  molecules  and  cells  [Miller  and  Bassler,  2001]  to  the  behavioral  ecol¬ 
ogy  of  animal  groups  [Okubo,  1986].  Flocks  of  birds  and  schools  of  fish  can 
travel  in  formation  and  act  as  one  unit  (see  [Parrish  et  ah,  2002]),  allowing 
these  animals  to  defend  themselves  against  predators  and  protect  their  ter¬ 
ritories.  Wildebeest  and  other  animals  exhibit  complex  collective  behaviors 
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when  migrating,  such  as  obstacle  avoiding,  leader  election,  and  formation 
keeping  (see  [Sinclair,  1977,  Gueron  and  Levin,  1993]).  Certain  foraging  be¬ 
haviors  include  individual  animals  partitioning  their  environment  into  non¬ 
overlapping  zones  (see  [Barlow,  1974]).  Honey  bees  [Seeley  and  Buhrman, 

1999],  gorillas  [Stewart  and  Harcourt,  1994],  and  whitefaced  capuchins  [Boinski  and  Campbell, 
1995]  exhibit  synchronized  group  activities  such  as  initiation  of  motion 
and  change  of  travel  direction.  These  remarkable  dynamic  capabilities  are 
achieved  apparently  without  following  a  group  leader;  see  [Okubo,  1986, 

Parrish  et  ah,  2002,  Gueron  and  Levin,  1993,  Barlow,  1974,  Seeley  and  Buhrman, 

1999,  Stewart  and  Harcourt,  1994,  Boinski  and  Campbell,  1995]  for  specific 
examples  of  animal  species  and  [Couzin  et  al.,  2005,  Conradt  and  Roper, 

2003]  for  general  studies.  A  comprehensive  exposition  of  bio-inspired  opti¬ 
mization  and  control  methods  is  presented  in  [Passino,  2004]. 

With  regards  to  distributed  motion  coordination  algorithms,  much  progress 
has  been  made  on  collective  pattern  formation  [Suzuki  and  Yamashita,  1999, 

Belta  and  Kumar,  2004,  Justh  and  Krishnaprasad,  2004,  Yang  et  al.,  2008, 

Sepulchre  et  al.,  2007],  flocking  [Tanner  et  al.,  2007,  Olfati-Saber,  2006,  Lee  and  Spong, 

2007,  Moshtagh  and  Jadbabaie,  2007],  motion  feasibility  of  formations  [Tabuada  et  al., 

2005] ,  formation  control  using  rigidity  and  persistence  theory  [Olfati-Saber  and  Murray, 

2002,  Baillieul  and  Suri,  2003,  Krick,  2007,  Yu  et  al.,  2008a,  Hendrickx  et  al., 

2007] ,  formation  stability  [Tanner  et  al.,  2004,  Lafferriere  et  al.,  2005,  Kang  et  al., 

2006,  Dunbar  and  Murray,  2006,  Smith  and  Hadaegh,  2007,  Zheng  et  al., 

2008] ,  motion  camouflage  [Justh  and  Krishnaprasad,  2006],  self-assembly  [Klavins  et  al., 

2006] ,  swarm  aggregation  [Gazi  and  Passino,  2003],  gradient  climbing  [Ogren  et  al., 

2004,  Cortes,  2007],  cyclic  pursuit  [Bruckstein  et  al.,  1991,  Marshall  et  al., 

2004,  Martinez  and  Bullo,  2006,  Smith  et  al.,  2005,  Pavone  and  Frazzoli, 

2007] ,  vehicle  routing  [Sharrna  et  al.,  2007],  motion  planning  with  collision 
avoidance  [Lumelsky  and  Harinarayan,  1997,  Pallottino  et  al.,  2007,  Hu  et  al., 

2007],  and  cooperative  boundary  estimation  [Bertozzi  et  al.,  2004,  Clark  and  Fierro, 

2007,  Casbeer  et  al.,  2006,  Zhang  and  Leonard,  2005,  Susca  et  al.,  2008].  It 
is  worth  also  mentioning  works  on  network  localization,  estimation,  and 
tracking,  e.g.,  see  [Barooah  and  Hespanha,  2007,  Aspnes  et  al.,  2006,  Oh  et  al., 

2007]  and  the  references  therein. 

Much  research  has  been  devoted  to  distributed  task  allocation  problems. 

The  work  in  [Gerkey  and  Mataric,  2004]  proposes  a  taxonomy  of  task  alloca¬ 
tion  problems.  In  papers  such  as  [Godwin  et  al.,  2006,  Alighanbari  and  How, 

2006,  Schumacher  et  al.,  2003,  Moore  and  Passino,  2007,  Tang  and  Ozgiincr, 

2005],  advanced  heuristic  methods  are  developed,  and  their  effectiveness  is 
demonstrated  through  analysis,  simulation  or  real  world  implementation. 

Distributed  auction  algorithms  are  discussed  in  [Castanon  and  Wu,  2003, 

Moore  and  Passino,  2007]  building  on  the  classic  works  in  [Bertsekas  and  Castanon, 

1991,  1993].  A  distributed  mixed-integer-linear-programming  solver  is  pro- 
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posed  in  [Alighanbari  and  How,  2006].  A  spatially  distributed  receding- 
horizon  scheme  is  proposed  in  [Frazzoli  and  Bulk),  2004,  Pavone  et  al.,  2007]. 

There  has  also  been  prior  work  on  target  assignment  problems  [Beard  et  al., 

2002,  Arslan  et  al.,  2007,  Zavlanos  and  Pappas,  2007a,  Smith  and  Bullo,  2007]. 

Target  allocation  for  vehicles  with  nonholonomic  constraints  is  studied  in  [Rathinam  et  al., 
2007,  Savla  et  al.,  2008,  2007a], 

3.6  PROOFS 

This  section  gathers  the  proofs  of  the  main  results  presented  in  the  chapter. 


3.6.1  Proof  of  Proposition  3.31 

Proof.  Let  t  i— >  x(t)  and  t  i— >  x{t)  denote  the  network  evolutions  starting 
from  xq  E  ILe/  Aq  1  under  CC  and  CC  (s  •p7),  respectively.  From  the  definition 
of  rescheduling,  one  can  verify  that,  for  all  k  E  Z>o, 

5l'1(f)  =  p1(ii,(fc_1)  +  1)’  fortEU2?0k1_i)+i[^^+1])  (361) 

\x®(t  -  F(k)  +  k),  for  t€  [F(k),F(k)  +  l\. 

By  definition  of  TC (T,  CC,x o),  we  have  T(x(k))  =  true,  for  all  k  >  TC (T,  CC,x o), 
and  T(x(TC(T]  CC,  xq)  —  1))  =  false.  Let  us  rewrite  these  equalities  in 
terms  of  the  trajectories  of  CC^p^.  From  (3.6.1),  one  can  write  x^(k)  = 
x^(F(k)),  for  all  i  E  I  and  k  E  Z>o-  Therefore,  we  have 

T(x(F(k)))  =  T(x{k))  =  true  ,  for  all  F(k)  >  F(TC{T,CC,x0)), 
T(x(F(TC(T,CC,x0)  -  1)))  =  T(x(TC(T,CC,x0)  -  1))  =  false, 

where  we  have  used  the  rescheduled  message-generation  function  in  (3.3.2). 

Now,  note  that  by  equation  (3.6.1),  x\l\l)  =  x\l\F(\l/ s\  —  1)  +  1),  for 
all  i  E  Z>o  and  all  j  E  /.  Therefore,  T(x(F(TC(T,  CC,  xo)  —  1)  +  1))  = 
T(x(F(TC(7j  CC,  xq))))  and  we  can  rewrite  the  previous  identities  as 

T(x(k))  =  true,  for  all  k  >  F(TC(T,  CC,  xo)  —  1)  +  1, 
T(x(F(TC(T,CC,x0)  -  1)))  =  false, 

which  imply  that  TC(T,  CC xo)  =  F(TC(T,  CC,  xq)  —  1)+1  =  s  TC(T,  CC,  xo). 
As  for  the  mean  communication  complexity,  additivity  of  Crnd  implies 

Crndo  M(£,x(£)) 

=  Crnd  o M(F(£)  -  s  +  1  ,x(F(£)  -  s  + 1))  +  ■  •  •  +  Crnd  o M(F(£),x(F(£))), 
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where  we  have  used  F(£  —  1)  +  1  =  F(£)  —  s  +  1.  We  conclude  the  proof  by 
computing 


TC(T,CC  (s^j)  5^0 )  1 

E  crndo 

£=0 

F(TC(T,CC  ,Xo)—l) 

E  Crndo  M(£,x(£)) 

£=0 

TC(T,CC,;r0)-l  F{1) 

=  E  E  Crnd°  M{k,x{k)) 

£=0  k=F(£)-s+l 

TC{T,CC,x0)-l 

=  E  CrndoAf(£,x(£)). 

£=0 


3.6.2  Proof  of  Theorem  3.33 

Proof.  In  the  following  four  STEPS  we  prove  the  two  upper  bounds  and  the 
two  lower  bounds: 

STEP  1:  We  start  by  proving  the  upper  bound  in  statement  (i).  We  claim 
that  TC(7^ir,CCAGREE  &  pursue)  <  27r/(fcpropr(n)),  and  we  reason  by  contra¬ 
diction,  i.e. ,  we  assume  that  there  exists  an  initial  condition  which  gives  rise 
to  an  execution  with  time  complexity  strictly  larger  than  2ir / (kpropr(n)). 
Without  loss  of  generality,  assume  dir^(O)  =  c.  For  £  <  2ir / (kpiopr(n)), 
define 

k{£)  =  argmin{distcc(6,^(0),  6^(£))  \  dir W(£)  =  cc,  i  6  {1, . . .  ,n}}. 

In  other  words,  agent  k(£)  is  the  agent  moving  counterclockwise  that  has 
smallest  counterclockwise  distance  from  the  initial  position  of  agent  n.  Note 
that  k(£)  is  well-defined  since,  by  hypothesis  of  contradiction,  7dir  is  false 
for  £  <  2f / (A:propr(n)).  According  to  the  state-transition  function  of  CCAgree  &  pursue 
(cf.  Section  3.1.3),  messages  with  dir  =  cc  can  only  travel  counterclock¬ 
wise,  while  messages  with  dir  =  c  can  only  travel  clockwise.  Therefore, 
the  position  of  agent  k(£)  at  time  £  can  only  belong  to  the  counterclockwise 
interval  from  the  position  of  agent  k( 0)  at  time  0  to  the  position  of  agent  n 
at  time  0. 

Let  us  examine  how  fast  the  message  from  agent  n  travels  clockwise.  To 
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this  end,  for  £  <  2tt  /  (kpmpr(ri)),  define 

j{£)  =  argmax{distc(0^(O), 6^(£))  J  max-idW(T)  =  n,  i  G  {1, . . .  ,n}}. 

In  other  words,  agent  j{£)  has  max-id  equal  to  n,  is  moving  clockwise,  and 
is  the  agent  furthest  from  the  initial  position  of  agent  n  in  the  clockwise 
direction  with  these  two  properties.  Initially,  j(0)  =  n.  Additionally,  for 
£  <  2-k  /  {kpropr(n)),  we  claim  that 

dist c(6^^\£),6^^+l^(£  +  1))  >  kpiopr(n). 

This  happens  because  either  (1)  there  is  no  agent  clockwise-ahead  of  6^^(£) 
within  clockwise  distance  r(n),  and  therefore,  the  claim  is  obvious,  or  (2) 
there  are  such  agents.  In  case  (2),  let  m  denote  the  agent  whose  clockwise 
distance  to  agent  j(£)  is  maximal  within  the  set  of  agents  with  clockwise 
distance  r(n)  from  }(£).  Then, 

distc(e^(£),6^e+1^(£+l)) 

=  dist  c(6>W£%),0H(f?+l)) 

=  distc(0^(4  9^{£))  +  distc(0H(£)^M(£  +  i)) 

>  dist c(0^(£),eW(£))  +  feprop(r(n)  -  distc^Wl^^H^)) 

=  ^’Propr(n)  +  (1  -  fcprop)  distc(0[jW1(f),  e^m\£))  >  kpropr(n), 

where  the  first  inequality  follows  from  the  fact  that  at  time  £  there  can 
be  no  agent  whose  clockwise  distance  to  agent  m  is  less  than  (r(n)  — 
distc(0^)l(T),  #[ml(T))).  Therefore,  after  2n/(kpiopr(  n ))  communication  rounds, 
the  message  with  max-id  =  n  has  traveled  the  whole  circle  in  the  clockwise 
direction,  and  must  therefore  have  reached  agent  k(£).  This  is  a  contradic¬ 
tion. 

STEP  2:  We  now  prove  the  lower  bound  in  statement  (i).  If  r(n)  >  n  for 
all  n,  then  l/r(n)  <  1/ir,  and  the  upper  bound  reads  TC(7^ir,CCAGREE  &  pursue)  G 
0(1).  Obviously,  the  time  complexity  of  any  evolution  with  an  initial  con¬ 
figuration  where  dirW(O)  =  cc  for  i  G  {1, . . . ,  n  —  1},  dir^O)  =  c  and 
^diakW(0[11(o),...,0M(o))  is  the  complete  graph,  is  lower  bounded  by  1. 
Therefore,  TC(7^ir,  CCAGREE  &  PURSUE)  G  12(1).  If  r(n)  >  tt  for  all  n,  then 
we  conclude  TC(7^ir,  CCAGREE  &  PURSUE)  G  ©(r(n)^1).  Assume  now  that 
r{n)  <  n  for  sufficiently  large  n.  Consider  an  initial  configuration  where 
dirW(O)  =  cc  for  i  G  {l,...,n  —  1},  dirlnl(0)  =  c,  and  the  agents  are 
placed  as  depicted  in  Figure  3.10.  Note  that,  after  each  communication 
round,  agent  1  has  moved  kpropr{n)  in  the  counterclockwise  direction,  while 
agent  n  has  moved  kpmpr(n)  in  the  clockwise  direction.  These  two  agents 
keep  moving  at  full  speed  towards  each  other  until  they  become  neighbors 
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Figure  3.10  Initial  condition  for  the  lower  bound  of  TC(7^ir,  CCAGREE  &  PUHSue),  with  0  < 
distc(0["-1](O),6l[rll(O))-r(n)  <e  and  distc (6>[1] (0) , (9[rl-1] (0))  <r(n)-e,  for 
some  fixed  e  >  0. 

at  a  time  lower  bounded  by 

27T  —  r(n )  7 r 

- —  > - 1. 

2^prop^”(tl)  ^prop^(tl) 

We  conclude  TC(Tdir,  CCAGree  &  pursue)  G  ^(r(n)^1). 

STEP  3:  We  now  prove  the  upper  bound  in  (ii).  We  begin  by  noting  that 
the  lower  bound  on  6  implies  r(n)-1  £  0(n).  Therefore,  TC(7^lr,  CCAGREE  &  pursue 
belongs  to  0(n)  and  is  negligible  as  compared  with  the  claimed  upper  bound 
estimates  for  TC(7^_eqdstnc)  CC  agree  &  pursue)-  In  what  follows,  we  therefore 
assume  that  Ta±t  has  been  achieved  and  that,  without  loss  of  generality,  all 
agents  are  moving  clockwise.  We  now  prove  a  fact  regarding  connectivity. 

At  time  £  £  Z>o,  let  H(l)  be  the  union  of  all  the  empty  “circular  segments” 
of  length  at  least  r(n),  that  is,  let 

H(£)  =  {x&E>1  |  min  distc(x,  Q^\£))  +  min  distcc(x,  0^1  (^))  >  r(n)}. 

je{l,...,n} 

In  other  words,  H(£)  does  not  contain  any  point  between  two  agents  sepa¬ 
rated  by  a  distance  less  than  r(n),  and  each  connected  component  of  H(£) 
has  length  at  least  r(n).  Let  uh{£)  be  the  number  of  connected  components 
of  H(£),  if  H {£)  is  empty,  then  we  take  the  convention  that  uh{£)  =  0. 
Clearly,  njj(£)  <  n.  We  claim  that,  if  nn{£)  >  0,  then  r  i— »  nn{£  +  t)  is 
non-increasing.  Let  d(£)  <  r(n)  be  the  distance  between  any  two  consecu¬ 
tive  agents  at  time  £.  Because  both  agents  move  in  the  same  direction,  a 
simple  calculation  shows  that 

d{£  +  1)  <  d{£)  +  kpTop(r  -  d{£))  =  (1  -  kpIop)d(£)  +  kpTopr(n) 

<  (1  ^prop)^”  T  kpr0pv(n )  —  r(n). 

This  means  that  the  two  agents  remain  within  distance  r(n),  and  therefore, 
connected  at  the  following  time  instant.  Because  the  number  of  connected 
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components  of  £gd.^r)(6^l\ . . .  ,9^)  does  not  increase,  it  follows  that  the 
number  of  connected  components  of  H  cannot  increase.  Next,  we  claim 
that,  if  n h{£)  >  0,  then  there  exists  r  >  i  such  that  n#(r)  <  nn{£)- 
By  contradiction,  assume  nn{£)  =  n#(r)  for  all  r  >  l.  Without  loss 
of  generality,  let  { 1 , . . . ,  m}  be  a  set  of  agents  with  the  properties  that 
distcc  (9^l\l),  0^+1l(^))  <  r(n),  for  i  E  {l,...,m},  that  9^l\l)  and  9^(£) 
belong  to  the  boundary  of  H(£),  and  that  there  is  no  other  set  with  the 
same  properties  and  more  agents.  (Note  that  this  implies  that  the  agents 
1, . . . ,  m  are  in  counterclockwise  order.)  One  can  show  that,  for  r  >  £, 

6»[1](r  +  1)  =  0[1](t)  -  fcpropr(n), 

6>M(t  +  1)  =  0W(t)  -  fcpropdistc(0W(r),0[*-1](r)), 

for  i  E  {2, . . . ,  m}.  If  we  consider  the  inter-agent  distances 

d(r)  =  (distcc(0[1](T),0l2](r)),...,distcc(0[m-1](r),0H(r)))  G  R^"1, 

then  the  previous  equations  can  be  rewritten  as 

d(r  "hi)  —  Tridm_i(fcprop,  1  &Prop;0)  g?(t)  -)-  r(7i)/uprope  i, 

where  the  linear  map  ( a,b,c )  i— »  Tridm_i (a,b,c)  E  is  defined 

in  Section  1.6.4.  This  is  a  discrete-time  affine  time-invariant  dynamical 
system  with  unique  equilibrium  point  r(n)lm_i.  By  construction,  the  initial 
condition  of  this  system  satisfies  ||d(0)  —  r(n) lm-i||2  <  r(n)y/m  —  1.  By 
Theorem  1.76(h)  in  Section  1.6.4,  for  rj i  E  ] 0, 1  [,  the  solution  r  i— >  d(r)  to 
this  system  reaches  a  ball  of  radius  rj \  centered  at  the  equilibrium  point  in 
time  0(m  log  m  +  log  (Here  we  used  the  fact  that  the  initial  condition 

of  this  system  is  bounded.)  In  turn,  this  implies  that  r  i— >  j  dj(r)  is 

larger  than  ( m  —  l)(r(n)  —  r/ 1)  in  time  0(m log m  +  logr/^-1).  We  are  now 
ready  to  find  the  contradiction  and  show  that  uh(t)  cannot  remain  equal  to 
nn(£)  for  all  time  r.  After  time  ©(mlogm  +  log??^1)  =  0{n  log  n  +  log?^1), 
we  have: 

nH{t) 

2-k  >  riH(£)r(n)  +  (r(n)  —  r]i)(mj  —  1) 

3= 1 

=  riH(£)r(n)  +  (n  -  nH(l))(r(n)  -  r) i)  =  riH(£)r) i  +  n(r(n)  -  ij i). 

Here  mi, . . .  ,mnH^  are  the  number  of  agents  in  each  isolated  group,  and 
each  connected  component  of  H{£)  has  length  at  least  r(n).  Now,  take 
r)i  =  ( nr(n )  —  27r)n_1  =  5{n)n~ 1,  and  the  contradiction  follows  from 

27T  >  nH{£)r/i  +  nr{n )  —  nr] \ 

=  nnitjf]  l  +  nr(n)  +  27r  —  nr(n )  =  n/f(f)?ji  +  27r. 

In  summary,  this  shows  that  the  number  of  connected  components  of  H{£) 
decreases  by  one  in  time  0(n\ogn  +  log?^-1)  =  0(n\ogn  +  log(rod(n)-1)). 

171 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


Note  that  S  being  lower  bounded  implies  n<5(n)-1  =  O(n),  and  therefore, 
0{n\ogn  +  log(n<5(n)-1))  =  0(n log n).  Iterating  this  argument  n  times,  in 
time  0(n2  log  n)  the  set  H  will  become  empty.  At  that  time,  the  resulting 
network  will  obey  the  discrete-time  linear  time-invariant  dynamical  system: 

d(r  +  1)  =  Circn(£:pr0p,  1  -  kpm p,0)  d(r),  (3.6.2) 

where  the  linear  map  (a,  b,  c)  i— >  Circ n(a,b,c)  G  Mrixn  is  defined  in  Sec¬ 
tion  1.6.4.  Hered(r)  =  ( distcc(#W(T),  9^  (r)), . . . ,  distcc (#["■! (r),  0[n+1l(r)))  G 
R£0,  with  the  convention  0[n+1l  =  flW.  By  Theorem  1.76(iii)  in  Section  1.6.4, 
in  time  0{n2  logs-1),  the  error  2-norm  satisfies  the  contraction  inequality 
||d(r)  —  d*||2  <  e||d(0)  —  d* || 2 ,  for  d*  =  ^ln.  We  convert  this  inequality  on 
2-norms  into  an  appropriate  inequality  on  oo-norms  as  follows.  Note  that 
|| d(0)  —  d*||oo  =  nraxig{li  |dW(0)  —  d*^|  <  2ir.  For  772  G  ]0, 1[  and  for  r  of 
order  n2  log  rj^1, 

|| d(r)  -  d* ||oc  <  ||d(r)  -  d*||2  <  %||d(0)  -  d*||2 

<  mVn\\d(0)  ~  d*||oo  <  r]22TT\/n. 

This  means  that  the  desired  configuration  is  achieved  for  rj22iT^/ri  =  e,  that 
is,  in  time  0(n2  logr^1)  =  0(n2  log(ne^1)).  In  summary,  the  equidistance 
task  is  achieved  in  time  0(n2  log(ne-1)). 

STEP  4:  Finally,  we  prove  the  lower  bound  in  (ii).  As  we  reasoned 
before,  TC(7^ir,  CCAGREE  &  pursue)  is  negligible  as  compared  with  the  claimed 
lower  bound  estimate  for  TC(7^_eqdstnc,  CCagree  &  pursue),  and  therefore,  we 
assume  that  Tiir  has  been  achieved.  We  consider  an  initial  configuration 
with  the  properties  that  (i)  agents  are  counterclockwise-ordered  according 
to  their  unique  identifier,  (ii)  the  set  H( 0)  is  empty,  and  (iii)  the  inter-agent 
distances  d(0)  =  (distcc(#W(o),  0[21(O)), . . . ,  distcc(#[nl  (0),  (0)))  are  given 

by 

/x  27T  7t  —  e' , 

d{ 0)  =  - In  H - (vn  +  vn), 

n  n 

where  e'  G  ]tt,  0[  and  where  vn  is  the  eigenvector  of  Circn(/cprop,  1  —  kprop,  0) 
corresponding  to  the  eigenvalue  1  —  A:prop  +  kpmp  cos  (^)  —  kprop \J  —  1  sin  (^) 
(see  Section  1.6.4).  One  can  verify  that  vn+vn  =  2(1,  cos(27r/n), . . . ,  cos((n— 
l)27r/n))  and  that  ||vn  +  vn||2  =  \/2n.  In  turn,  this  implies  that  d(0)  G  M>0 
and  that  ||d(0)  —  ^ln||2  G  0{l/y/n).  Take  77 3  G  ] 0, 1  [.  The  argument  de¬ 
scribed  in  the  proof  of  Theorem  1.76(iii)  leads  to  the  following  statement: 
the  2-norm  of  the  difference  between  1 1— »  d(£)  and  the  desired  configuration 
^1  n  decreases  by  a  factor  773  in  time  of  order  n2  logry^"1 .  Given  an  initial 
error  of  order  0(l/y/n)  and  a  final  desired  error  of  order  e,  we  set  773  =  £y/n 
and  obtain  the  desired  result  that  it  takes  time  of  order  n2log(?re)_1  to 
reduce  the  2-norm  error,  and  therefore,  the  oo-norm  error  to  size  e.  This 
concludes  the  proof.  ■ 
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3.6.3  Proof  of  Theorem  3.34 

Proof.  Note  that  the  number  of  edges  in  5circie  is  at  most  0(n2)  as  it  is 
possible  that  all  robots  are  within  distance  r(n)  of  each  other.  The  upper 
bounds  in  (i)  and  (ii)  follow  then  from  inequality  (3.3.1)  and  Theorem  3.33. 
To  prove  the  lower  bounds  we  follow  the  steps  and  notation  in  the  proof  of 
Theorem  3.33.  Regarding  the  lower  bounds  in  (i),  we  examine  the  evolu¬ 
tion  of  the  initial  configuration  depicted  in  Figure  3.10.  From  STEP  2:  in 
the  proof  of  Theorem  3.33,  recall  that  the  time  it  takes  agent  1  to  receive 
the  message  with  max-id  =  n  is  lower  bounded  by  n / (kpropr(n))  —  1.  Our 
proof  strategy  is  to  lower  bound  the  number  of  edges  in  the  graph  until  this 
event  happens.  Note  that,  at  initial  time,  there  are  (n  —  l)2  edges  in  the 
communication  graph  of  the  network,  and  therefore,  (n  —  l)2  messages  get 
transmitted.  At  the  next  communication  round,  agent  1  has  moved  kpvopr(n) 
counterclockwise,  and  therefore,  the  number  of  edges  is  lower  bounded  by 
(n  —  2)2.  Iterating  this  reasoning,  we  see  that  after  i  <  n  /  (kpmpr(n))  com¬ 
munication  rounds,  the  number  of  edges  is  lower  bounded  by  (n  —  i)2 .  Now, 
if  S(n)  >  vr(l/fcprop  —  2),  then  n  >  vr/A:propr(n)),  and  therefore,  the  total 
communication  complexity  is  lower  bounded  by 


^  (n  —  i)2  £  fl(n2r(n)  1). 

i=  1 

On  the  other  hand,  if  6(n)  <  vr(l/A:prop— 2),  then  n  <  n /kpTopr(n)),  and  after 
n  time  steps,  we  lower  bound  the  number  of  edges  in  the  communication 
graph  by  the  number  of  edges  in  a  chain  of  length  n,  that  is,  n—  1.  Therefore, 
the  total  communication  complexity  is  lower  bounded  by 

n 

^(n  —  i)2  +  (n  —  1) (- - n  —  n)  £  f l(n3  +  nr(n)”1). 

.  ^  ''fcpropO^J 

The  two  lower  bounds  match  when  <5(n)  =  7r ( 1  / fcprop  —  2). 

Regarding  the  lower  bound  in  (ii) ,  we  consider  first  the  case  when  nu  (0)  = 
0.  In  this  case,  the  network  obeys  the  discrete-time  linear  time-invariant  dy¬ 
namical  system  (3.6.2).  Consider  the  initial  condition  d( 0)  that  we  adopted 
for  STEP  4: ■  We  know  it  takes  time  of  order  n2log(ne)_1  for  the  appro¬ 
priate  contraction  property  to  hold.  At  d(0),  the  maximal  inter-agent  dis¬ 
tance  is  (4-7t  —  e')/n  and  it  decreases  during  the  evolution.  Because  each 
robot  can  communicate  with  any  other  robot  within  a  distance  r(n),  the 
number  of  agents  within  communication  range  of  a  given  agent  is  of  order 
r(n)n/(47r  —  e'),  that  is,  of  order  5(n).  From  here  we  deduce  that  the  total 
communication  complexity  belongs  to  f i(n3d(n)  log(ne)-1).  ■ 
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3.7  EXERCISES 


E3.1  (Orientation  dynamics).  We  review  some  basic  kinematic  concepts  about  ori¬ 
entation  dynamics,  e.g.,  see  [Bullo  and  Lewis,  2004,  Spong  et  al.,  2006].  Define 
the  set  of  skew  symmetric  matrices  in  Rdxd  as 

so (d)  =  {S  £  Rdxd  |  S  =  —ST}. 


Let  x  denote  the  cross-product  on  R3  and  define  the  linear  mapC :  R3  — >  so (3)  by 
xy  =  x  x  y  for  all  y  £  R3. 

(i)  Show  that,  if  x  =  {x\,  X2,  *3),  then: 


(ii) 


0 

X3 


-X3  X2 
0  —Xl 


—X2  Xl  0 

Given  a  differentiable  curve  R  :  [0,T]  — >  SO(3),  show  that  there  exists  a 
curve  u>  :  [0,  T\  — >  R3  such  that 


R(t)  =  R{t)Q{t). 


These  two  results  lead  to  a  motion  model  of  a  relative  sensing  network  with 
time-varying  orientation.  Generalizing  the  constant-orientation  model  in  equa¬ 
tion  (3.2.2),  the  complete  position  and  orientation  dynamics  may  be  written  as 

PsIedO)  =  -RfiledO)  ui] > 

41dW  =  41dW^1, 

where,  for  i  £  {1, . . . ,  n},  and  luj1'  are  the  linear  and  the  body  angular  velocities 
of  robot  i,  respectively. 

E3.2  (Variation  of  agree  &  pursue  control  and  communication  law).  Consider 
the  AGREE  &  PURSUE  control  and  communication  law  defined  in  Section  3.1.3  with 
the  state  transition  function  replaced  by  the  following: 

function  stf($,  w,  y) 

1:  for  each  non-null  message  (#rcvd,  (dirrcvd, max-idrcva))  in  y  do 
2:  if  (max-idrCvd  >  max-id)  then 

3:  new-dir  :=  dirrcvd 

4:  new-id  :=  max-idrcva 

5:  return  (new-dir,  new-id) 

The  only  difference  between  this  law  and  the  AGREE  &  PURSUE  law  in  Sec¬ 
tion  3.1.3  is  that,  at  each  communication  round,  each  agent  picks  the  message 
with  the  largest  value  of  max-id  among  all  messages  received  (instead  of  among 
the  messages  received  only  from  agents  moving  towards  its  position).  We  refer  to 
this  law  as  mod-agree  &  pursue. 

Consider  the  direction  agreement  task  Tm  :  (S1)"  x  Wn  — *  {true,  false} 
defined  in  Example  3.22.  Assume  dir^  (0)  =  c,  and  let  k  £  {1, . . . ,  n  —  1}  be  the 
largest  identity  such  that  dir^(0)  =  cc.  Do  the  following  tasks: 

(i)  Show  that,  if  the  message  from  agent  k  gets  delivered  to  agents  clockwise- 
placed  with  respect  to  agent  k  along  two  consecutive  communication 
rounds,  then  the  message  from  agent  k  has  traveled  at  least  (l  —  kprop)r(n) 
along  the  circle  in  the  clockwise  direction; 
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(ii)  show  that,  if  distcc(0^n^  (0),  0^1  (0))  <  2  r(n),  then 


TC  (Tdir,CC  MOD- AGREE  &  PURSUE  >  %Q  j  ^0  ) 


0(r(n)  *); 


(iii)  implement  the  algorithm  in  your  favorite  simulation  software  (e.g.,  Math- 
ematica,  Matlab,  or  Maple),  and  compute  the  time  complexity  of  multi¬ 
ple  executions  of  the  algorithm  starting  from  different  initial  conditions. 
Does  your  simulation  analysis  support  the  conjecture  that 


TC  (Tdir,CC 


MOD-AGREE  &  PURSUE 


) 


O (r(n)  *)? 


For  the  simulation  analysis  to  be  relevant,  you  should  use  a  large  number 
of  randomly  generated  initial  physical  positions  and  processor  states. 

E3.3  (Leader- following  flocking).  Consider  a  group  of  robots  moving  in  R2  accord¬ 
ing  to  the  following  discrete-time  version  of  the  planar  vehicle  dynamics  introduced 
in  Example  3.1 


x[£  +  1)  =  x(£)  +  v  cos (9(£)), 
y(l  +  1)  =  y(l)  +  v  sin  (0(f)), 

0(£  +  1)  ?=  0{£)  +  uj. 

We  let  {(p^ , #W), . . . ,  (p^,0^)}  denote  the  robot  physical  states,  where  p^  = 
(x^\  pW)  G  R2  corresponds  to  the  position  and  G  [0,  27t)  corresponds  to  the 
orientation  of  the  robot  i  G  I.  As  communication  graph,  we  adopt  the  r-disk 
graph  Sdisk(r)  introduced  in  Section  2.2. 

Assume  all  agents  move  at  unit  speed,  v  =  1,  and  update  their  heading  accord¬ 
ing  to  the  leader-following  version  of  Vicsek’s  model  (see  equation  (1.6.5)): 

6ll](£+  1)  =  6[1](£),  (E3.1) 

0[ii(l+  1)  =  avrg({0w(f)}U{0M(£)  |  j  s.t.  ||pw(f)  -  p[i](f)||2  <  r}), 

for  i  G  {2, . . . ,  n}.  Do  the  following  tasks: 

(i)  Write  the  algorithm  formally  as  a  control  and  communication  law  as 
defined  in  Section  3.1.2; 

(ii)  given  initial  conditions  for  the  position  and  orientation  of  the  robots, 
express  (E3.1)  as  the  time-dependent  linear  iteration  associated  to  a  se¬ 
quence  of  matrices  {F(£)  \  £  G  Z>o}.  Are  these  matrices  stochastic?  Are 
they  symmetric?  Is  the  sequence  non-degenerate?;  and 

(iii)  we  loosely  define  the  flocking  task  as  achieving  agreement  on  the  heading 
of  the  agents.  Using  Theorem  1.60,  identify  connectivity  conditions  on 
the  sequence  of  graphs  determined  by  the  evolution  of  the  network  that 
guarantee  that  agents  achieve  flocking.  What  is  the  final  orientation  in 
which  the  network  flocks? 
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Chapter  Four 

Connectivity  maintenance  and  rendezvous 


The  aims  of  this  chapter  are  twofold.  First,  we  introduce  the  rendezvous 
problem  and  analyze  various  coordination  algorithms  that  achieve  it,  provid¬ 
ing  upper  and  lower  bounds  on  their  time  complexity.  Second  we  introduce 
the  problem  of  maintaining  connectivity  among  a  group  of  mobile  robots 
and  use  geometric  approaches  to  preserve  this  topological  property  of  the 
network. 

Loosely  speaking,  the  rendezvous  objective  is  to  achieve  agreement  over 
the  physical  location  of  as  many  robots  as  possible,  i.e.,  to  steer  the  robots 
to  a  common  location.  This  objective  is  to  be  achieved  with  the  limited 
information  flow  described  in  the  model  of  the  network.  Typically,  it  will 
be  impossible  to  solve  the  rendezvous  problem  for  all  robots  if  the  robots 
are  placed  in  such  a  way  that  they  do  not  form  a  connected  communication 
graph.  Therefore,  it  is  reasonable  to  assume  that  the  network  is  connected 
at  initial  time  and  that  a  good  property  of  any  rendezvous  algorithm  is  that 
of  maintaining  some  form  of  connectivity  among  robots.  This  discussion 
motivates  the  connectivity  maintenance  problem.  Once  a  model  for  when 
two  robots  can  acquire  each  other’s  relative  position  is  adopted,  this  problem 
is  of  particular  relevance  as  the  inter-robot  topology  depends  on  the  physical 
states  of  the  robots.  Our  exposition  here  is  mainly  based  on  [Ando  et  ah, 
1999,  Cortes  et  ah,  2006,  Ganguli  et  ah,  2007b], 

The  chapter  is  organized  as  follows.  In  the  first  section  of  the  chapter,  we 
formally  introduce  the  two  coordination  problems.  In  the  second  section, 
we  define  various  connectivity  constraint  sets  to  limit  the  robot  motion  in 
order  to  maintain  network  connectivity.  This  allows  us  to  study  in  the 
next  section  various  rendezvous  algorithms  with  connectivity  maintenance 
properties.  We  study  the  circumcenter  algorithm  and  characterize  its  com¬ 
plexity.  Additionally,  we  introduce  the  perimeter  minimizing  algorithm  for 
nonconvex  environments.  The  fourth  section  presents  various  simulations 
of  the  proposed  motion  coordination  algorithms.  We  end  the  chapter  with 
three  sections  on,  respectively,  bibliographical  notes,  proofs  of  the  results 
presented  in  the  chapter,  and  exercises.  Our  technical  treatment  is  based  on 
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the  LaSalle  Invariance  Principle  and  on  geometric  tools  such  as  proximity 
graphs  and  robust  visibility. 


4.1  PROBLEM  STATEMENT 

We  here  review  the  class  of  networks  and  the  types  of  problems  that  will  be 
considered  in  the  chapter. 


4.1.1  Networks  with  discrete-time  motion 

Along  the  chapter,  we  will  consider  the  robotic  networks  5disk;  5ld  and 
^oo-diskj  and  the  relative-sensing  networks  S™isk  and  ^is-disk  presented  in 
Example  3.4  and  in  Section  3.2.2. 

For  the  robotic  networks  5disk,  5ld  and  5oo-disk>  we  will,  however,  assume 
that  the  robots  move  in  discrete  time,  that  is,  we  adopt  the  discrete-time 
motion  model 

p®(£  +  l)=pM(£)  +  u®(£),  *€{l,...,n}.  (4.1.1) 

Similarly,  for  the  relative-sensing  networks  5^isk  an<4  ^vis-disk  >  we  adopt  the 
discrete-time  motion  model 

P& ced(£  +  !)  =  PfixedW  +  G  {1,  ■  ■  •  ,  n}-  (4-L2) 

As  an  aside,  if  we  express  the  previous  equation  with  respect  to  frame  i  at 
time  t,  then  equation  (4.1.2)  reads 

P(frame  i  at  time  t)  +  1 )  =  “(flame  i  at  time  i  G  { 1,  .  .  .  ,  n}. 

We  present  the  treatment  in  discrete-time  for  simplicity.  It  is  easy  to  show 
that  any  control  law  for  the  discrete-time  motion  model  can  be  implemented 
in  the  continuous-time  networks.  We  usually  assume  no  bound  on  the  con¬ 
trol  or  rtmax  as  magnitude  bound  and  we  explicitly  state  when  we  instead 
introduce  the  bound. 


4.1.2  The  rendezvous  task 

Next,  we  discuss  the  rendezvous  problem.  There  are  different  ways  of  formu¬ 
lating  this  objective  in  terms  of  task  maps.  Let  S  =  ({1, . . . ,  n},  77,  Ecmm) 
be  a  uniform  robotic  network.  The  (exact)  rendezvous  task  Tv ndZvs  :  Xn  — > 
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{true,  false}  for  S  is  the  coordination  task  defined  by: 


^ndzvS(a:[1I,---,a:[nI) 

f true,  if  =  x^\  for  all  ( i,j )  G  ECUim{x^\  . . . ,  x M), 

1  false,  otherwise. 

Next,  assume  that,  for  the  same  network  S  =  ({1 . n},  1Z,  Er mm),  the 

robots’  physical  state  space  is  X  C  M.d.  It  is  convenient  to  review  some 
basic  notation  consistent  with  what  we  adopted  in  Chapter  2.  We  let  V  = 
{pW, . . .  ,p^}  denote  the  set  of  agents  location  in  X  C  M.d  and  P  be  an 
array  of  n  points  in  Wl.  Furthermore,  we  let  avrg  denote  the  average  of  a 
finite  point  set  in  M.d,  that  is, 

avrg({gi, . . . ,  qk })  =  ^(<Ji  4 - b  qk )• 

For  e  G  M>o,  the  e -rendezvous  task  T£_  rndzvs  =  (Md)n  — >  {true,  false}  for 
S  is  dehned  as  follows:  X-rndzvs  is  true  at  P  if  each  robot  position  pW,  for 
i  G  {1, . . . ,  to},  is  at  distance  less  than  e  from  the  average  position  of  its 
-Ey-mm-neighbors.  Formally, 

^e-rndzvs  (P)  —  true 

<*=>  \\P[t]  -avrg  ({p^J  |  (i,j)  G  Scmm(F,)})||2  <  £,  *G{l,...,n}. 


4.1.3  The  connectivity  maintenance  problem 

Assume  the  communication  graph,  computed  as  a  function  of  the  robot 
positions,  is  connected:  how  should  the  robots  move  in  such  a  way  that 
their  communication  graph  is  again  connected.  Clearly,  the  problem  depends 
upon  (1)  how  do  the  robots  move,  and  (2)  what  proximity  graph  describes 
the  communication  graph  or,  in  the  case  of  relative-sensing  networks,  what 
sensor  model  is  available  on  each  robot. 

The  key  idea  is  to  restrict  the  allowable  motion  of  each  agent.  Different 
motion  constraint  sets  correspond  to  different  communication  or  sensing 
graphs.  We  have  three  objectives  in  doing  so.  First,  we  aim  to  achieve 
this  objective  only  based  on  local  measurements  or  1-hop  communication, 
i.e. ,  without  introducing  processor  states  explicitly  dedicated  to  this  task. 
Second,  the  constraint  sets  should  depend  continuously  on  the  position  of 
the  robots.  Third,  we  have  the  somehow  informal  objective  to  design  the 
constraint  sets  as  “large”  as  possible  so  as  to  minimally  constrain  the  motion 
of  the  robots. 
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4.2  CONNECTIVITY  MAINTENANCE  ALGORITHMS 

In  this  section  we  present  some  algorithms  that  might  be  used  by  a  robotic 
network  to  maintain  communication  connectivity.  The  results  presented  in 
this  section  start  with  the  original  idea  introduced  by  Ando  et  al.  [1999]  for 
first-order  robots  communicating  along  the  edges  of  a  disk  graph,  that  is, 
for  the  network  described  in  Example  3.4.  This  idea  is  then  generalized  to 
a  number  of  useful  settings.  The  properties  of  proximity  graphs  presented 
in  Section  2.2  play  a  key  role  in  formulating  and  solving  the  connectivity 
problem. 


4.2.1  Enforcing  range-limited  links 

First,  we  aim  to  constrain  the  motion  of  two  first-order  agents  in  order  to 
maintain  a  communication  link  between  them.  We  assume  that  the  commu¬ 
nication  takes  place  over  the  disk  graph  f/disk(r)  with  communication  range 
r  >  0. 

Loosely  stated,  the  pairwise  connectivity  maintenance  problem  is  as  fol¬ 
lows:  given  two  neighbors  in  the  proximity  graph  t/disk(r))  find  a  rich  set  of 
control  inputs  for  both  agents  with  the  property  that,  after  moving,  both 
agents  are  again  within  distance  r.  We  provide  a  solution  to  this  problem 
as  follows. 

Definition  4.1  (Pairwise  connectivity  constraint  set).  Consider  two 
agents  i  and  j  at  positions  p^  £  and  p^  £  such  that  ||pM  —  p^‘  ||2  <  r. 
The  connectivity  constraint  set  of  agent  i  with  respect  to  agent  j  is 

Note  that  both  robots  i  and  j  can  independently  compute  their  respective 
connectivity  constraint  sets.  The  proof  of  the  following  result  is  straightfor¬ 
ward. 

Lemma  4.2  (Maintaining  pairwise  connectivity).  Assume  that  at  time  £ 
the  distance  between  agents  pM  and  p ^  is  no  more  than  r.  If  the  control 
u^(£)  takes  value  in 

u[Hi)  £  Tdisk (p[il(£),p[%)) -p[H^  =  B{plj]{n~p[t]{£\7-), 

and,  similarly,  u^(£)  £  Adisk (p^  (£),p^  (£))  —  p^(£),  then,  according  to  the 
discrete-time  motion  model  (4.1.1) 
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(i)  both  agents  positions  at  time  £  +  1  are  inside  Rdisk(p^  {£),P^  {£)) / 
and 

(ii)  the  distance  between  the  agents  positions  at  time  £  +  1  is  no  more 
than  r. 

We  illustrate  these  pairwise  connectivity  maintenance  concepts  in  Fig¬ 
ure  4.1. 


Figure  4.1  Illustration  of  the  connectivity  maintenance  constraint.  Starting  from  positions 
pW  and  the  robots  are  restricted  to  move  inside  the  disk  centered  at 


dldisk (pW,PM)  =  |(pW  +  PM)  with  radius  §. 


Remark  4.3  (Constraints  for  relative-sensing  networks).  Let  us  con¬ 
sider  a  relative-sensing  network  with  a  disk  sensor  of  radius  r;  see  Exam¬ 
ple  3.15.  Recall  the  following  facts  about  this  model:  First,  agent  i  measures 
the  position  of  robot  j  in  its  frame  ,  that  is,  robot  i  measures  p^K  Sec¬ 
ond,  p^  =  Od-  Third,  if  W  C  Md,  then  Wt  denotes  its  expression  in  the 
frame  E M .  Combining  these  notions  and  assuming  the  inter-agent  distance 
is  no  more  than  r,  the  pairwise  connectivity  constraint  set  in  Definition  4.1 
satisfies: 

[ j] 

(^disk (p[l\pb]))  ,  =  Tdisk(Od,pJjl)  =  b(^~,  0.  • 


4.2.2  Enforcing  network  connectivity 

Here,  we  focus  on  how  to  constrain  the  mobility  of  multiple  agents  in  order 
to  maintain  connectivity  for  the  entire  network  that  they  form.  We  again 
consider  the  case  of  first-order  agents  moving  according  to  the  discrete-time 
equation  (4.1.1)  and  communicating  over  f/disk(r)- 
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Loosely  stated,  the  network  connectivity  maintenance  problem  is  as  fol¬ 
lows:  Given  n  agents  at  positions  V(£)  =  {pW(£),  ■  •  • , p^ (£)}  in  which  they 
form  a  connected  r-disk  graph  <7disk(0>  the  objective  is  to  find  a  rich  set 
of  control  inputs  for  all  agents  with  the  property  that,  at  time  l  +  1,  the 
agents’  new  positions  V{£  +  1)  form  again  a  connected  r-disk  graph  t/diskM- 
We  provide  a  simple,  but  potentially  conservative,  solution  to  this  problem 
as  follows. 

Definition  4.4  (Connectivity  constraint  set).  Consider  a  group  of  agents 
at  positions  V  =  {pM, . . .  ,  p[nl}  C  The  connectivity  constraint  set  of 
agent  i  with  respect  to  V  is 

Xdisk(pb],V)  =  P|  {Tdisk(pW,g)  |  q  €  P\{p[,|J  s.t.  \\q-p[l]h  <  r}.  • 


In  other  words,  if  are  agents  positions  whose  distance  from  pH 

is  no  more  than  r,  then  the  connectivity  constraint  set  for  agent  i  is  the 
intersection  of  the  constraint  sets  i?p(rp,  +p^),  5)  for  k  £  {1, . . . ,  Z};  see 
Figure  4.2. 


Figure  4.2  Illustration  of  network  connectivity  maintenance.  The  connectivity  Adisk- 
constraint  set  of  the  white-colored  agent  is  the  intersection  of  the  individual 
constraint  sets  determined  by  its  neighbors. 


The  following  result  is  a  consequence  of  Lemma  4.2. 

Lemma  4.5  (Maintaining  network  connectivity).  Consider  a  group 
of  agents  at  positions  V{£)  =  (pW(^), . . . , pM(£)}  c  Rd  at  time  £.  If  each 
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agent’s  control  (l)  takes  value  in 

u®(£)  G  Xdisk(p^(£),V(£))  ~p®(£),  i  <E  {1,  •  •  •  ,n}, 

then,  according  to  the  discrete-time  motion  model  (4.1.1) 

(i)  each  agent  remains  in  its  connectivity  constraint  set,  that  is,  p^(£-\- 
1)  €  *disk 

(ii)  each  edge  of  Udisk(0  at  V(£)  is  maintained  after  the  motion  step, 
i.e.,  if  ||p^(^)  —  P^{£)\\2  <  r,  then  also  \\p^{£-\-l)  —  p^\£+l)\\2  <  r; 

(Hi)  if  1/disk  M  at  time  £  is  connected,  then  Udisk(0  at  time  £  +  1  is 
connected;  and 

(iv)  the  number  of  connected  components  of  the  graph  Udisk(0  at  time 
£+1  is  equal  to  or  smaller  than  the  number  of  connected  components 
of  the  graph  UdiskM  at  time  £. 

Remark  4.6  (Constraints  for  relative-sensing  networks:  cont’d). 

Following  up  on  Remark  4.3,  the  connectivity  constraint  set  in  Definition  4.4, 
written  in  the  frame  SW,  is 

*disk(0d,  {pf\---,p[p]}) 

\j] 

= n 0 1  d  ^ *  such that  -p[t]h  <r}-  • 

Next,  we  relax  the  constraints  in  Definition  4.4  to  provide  the  network 
nodes  with  larger,  and  therefore  less  conservative,  motion  constraint  sets. 
Recall  from  Section  2.2  the  relative  neighborhood  graph  Ur,Nj  the  Gabriel 
graph  Qq,  and  the  r-limited  Delaunay  graph  Uld(0-  These  proximity  graphs 
are  illustrated  in  Figure  2.8.  From  Theorem  2.9  and  Proposition  2.10,  re¬ 
spectively,  recall  that  the  proximity  graphs  Urn  FI  Udisk(0>  Ug  FI  Udisk(0  and 
Uld(?")  have  the  following  properties: 

(i)  They  have  the  same  connected  components  as  Udisk(0>  that  is,  for 
all  point  sets  V  C  Md,  all  graphs  have  the  same  number  of  connected 
components  consisting  of  the  same  vertices;  and 

(ii)  they  are  spatially  distributed  over  1/disk  (r). 

These  mathematical  facts  have  two  implications.  First,  to  maintain  or  de¬ 
crease  the  number  of  connected  components  of  a  disk  graph,  it  is  sufficient  to 
maintain  or  decrease  the  number  of  connected  components  of  any  of  the  three 
proximity  graphs  Urn  Fl  Udisk(0>  Ug  FI  Udisk(0  and  Uld(0-  Because  each  of 
these  graphs  is  more  sparse  than  the  disk  graph,  i.e.,  they  are  subgraphs 
of  Udisk(r)>  fewer  connectivity  constraints  need  to  be  imposed.  Second,  be¬ 
cause  these  proximity  graphs  are  spatially  distributed  over  the  disk  graph, 
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it  is  possible  for  each  agent  to  determine  which  of  its  neighbors  in  t/disk(r) 
are  its  neighbors  also  in  these  subgraphs.  We  formalize  this  discussion  as 
follows. 

Definition  4.7  ((/-connectivity  constraint  set).  Let  G  be  a  proxim¬ 
ity  graph  that  is  spatially  distributed  over  (/diskM  and  that  has  the  same 
connected  components  as  (/diskM-  Consider  a  group  of  agents  at  positions 
V  =  {pM, . . .  ,£>[”]}  C  Mrf.  The  G- connectivity  constraint  set  of  agent  i  with 
respect  to  V  is 

^disk ,g(jp^\v)  =  Pi  {-Tdisk (p^,q)  |  q  e  V  s.t.  (g,pW)  is  an  edge  of  Q{V)}. 


Lemma  4.8  (Maintaining  connectivity  of  sparser  networks).  Let  G 

be  a  proximity  graph  that  is  spatially  distributed  over  ^disk(^)  and  that  has 
the  same  connected  components  as  f/diskM-  Consider  a  group  of  agents  at 
positions  V(£)  =  {pW(£),  •  •  •  ,p^(^)}  C  M.d  at  time  £.  If  each  agent’s  control 
u^\l)  takes  value  in 

«W(0  €  XdisKg{p[i](t),V(e)) i  G  {1, . . .  ,n}, 
then,  according  to  the  discrete-time  motion  model  (4.1.1) 

(i)  each  agent  remains  in  its  G -connectivity  constraint  set; 

(ii)  two  agents  that  are  in  the  same  connected  component  of  G  remain 
at  the  same  connected  component  after  the  motion  step;  and 

(in)  the  number  of  connected  components  of  the  graph  G  at  V(£  +  1)  is 
equal  to  or  smaller  than  the  number  of  connected  components  of  the 
graph  G  at  V  (£) . 

The  reader  is  asked  to  provide  a  proof  of  this  result  in  Exercise  E4.1. 


4.2.3  Enforcing  range-limited  line-of-sight  links  and  network  connectivity 

Here,  we  consider  the  connectivity  maintenance  problem  for  a  group  of 
agents  with  range-limited  line-of-sight  communication,  as  described  in  Ex¬ 
ample  3.6.  It  is  convenient  to  treat  directly  and  only  the  case  of  a  com¬ 
pact  allowable  nonconvex  environment  Q  C  R2  contracted  into  Qs  =  {q  £ 
Q  |  dist(g,  dQ)  >  (5}  for  a  small  positive  5.  We  present  a  solution  based  on 
designing  constraint  sets  that  guarantee  that  every  edge  of  the  range- limited 
visibility  graph  ^vis_disk,Q5  is  preserved. 

We  begin  with  a  useful  observation  and  a  corresponding  geometric  algo- 
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rithm.  Assume  that,  at  time  £,  robot  j  is  inside  the  range-limited  visibility 
set  from  pW  in  Qs,  that  is,  with  the  notation  of  Section  2.1.2, 

Plj](£)  G  Vidisk(p[i] (£);Qs)  =  Vi(pW(ty,Qs)nB(pW(£),r). 

This  property  holds  also  at  time  1+  1  if  ||pM(f  +  1)  —  p^ (£  +  1) || 2  <  r  and 
[pM(f  +  1  ),p^(£  +  1)]  C  Qs-  A  sufficient  condition  is  therefore  that 

pW(£+l),  p[j](£+  1)  €  A, 

for  some  convex  subset  A  of  Qs  0  £>Q(pM(f)  +p (^)),  |) .  Intuitively  speak¬ 
ing,  X  plays  the  role  of  A-constraint  set  for  the  proximity  graph  ^vis-disk, Qs  ■ 
The  following  geometric  algorithm,  given  the  positions  pH  and  p in  an 
environment  Qs,  computes  precisely  one  such  convex  subset: 

function  ITERATED  TRUNCATION  (pM  ,  p®;Qs) 

%  Executed  by  robot  i  at  position  pW  assuming  that  robot  j  is  at  position 
pd  1  within  range-limited  line  of  sight  of  pM 

1:  Atemp  :=  Vidisk(pM ;  Qs)  D  B  (i  (pW  +pW),  0 

2:  while  9Atemp  contains  a  concavity  do 

3:  v  :=  a  strictly  concave  point  of  dAtemp  closest  to  [pW,pW] 

4:  Atemp  . —  Atemp  H  Hqs  (u) 

5:  return  Atemp 

Note:  in  step  3:  multiple  points  belonging  to  distinct  concavities  may 
satisfy  the  required  property.  If  so,  v  may  be  chosen  as  any  of  them. 

Figure  4.3(b)  illustrates  an  example  convex  constraint  set  computed  by 
the  iterated  truncation  algorithm.  Figure  4.4  illustrates  the  step-by- 
step  execution  required  to  generate  Figure  4.3(b). 

Next,  we  characterize  the  main  properties  of  the  iterated  truncation 
algorithm.  It  is  convenient  to  define  the  set 

J  =  {(p,  q)  G  Qs  X  Qs  |  \p,  q]  G  <2,5  and  ||p  -  q\ |2  <  r}. 

Proposition  4.9  (Properties  of  the  iterated  truncation  algorithm). 

Consider  the  5-contraction  of  a  compact  allowable  environment  Qs  with  k 
strict  concavities,  and  let  (pH ,  pM )  £  J.  The  following  statements  hold: 

(i)  The  iterated  truncation  algorithm,  invoked  with  arguments 
C p[i]  ,p^;Qs),  terminates  in  at  most  k  steps;  denote  its  output  by 

Avis-disk  (0 1\  p[j] ;  Qs); 

(ii) 

‘^•vis-disk  (p[<1  ,  pM;  Qs)  is  nonempty,  compact  and  convex; 

(in)  Avis.disk(pW , p^l ;  Qs)  =  Avis.disk(pW,pW;  Qs);  and 

(iv)  the  set-valued  map  (p,q)  >— >•  Avig_disk(p,  q;  Qs)  is  closed  at  all  ( p,q )  E 

J. 
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Figure  4.3  The  plot  in  (a)  shows  the  set  Vidisk(p^;  Qs)  (~l  +p^),  §).  The  plot  in 

(b)  shows  the  outcome  of  the  execution  of  the  iterated  truncation  algo¬ 
rithm.  Robots  i  and  j  are  constrained  to  remain  inside  the  shaded  region  in 
(b) ,  which  is  a  convex  subset  of  Qs  and  of  the  closed  ball  with  center  |  (p^  +p^ ) 
and  radius 


Figure  4.4  From  left  to  right,  a  sample  run  of  the  iterated  truncation  algorithm. 

The  initial  set  Ttemp  :=  Vidisk(pW ;  Qs)  fl  B(§(pW  +p[il),  §)  is  shown  in  Fig¬ 
ure  4.3(a).  The  lightly  and  darkly  shaded  regions  together  represent  Atemp 
at  the  current  iteration.  The  darkly  shaded  region  represents  Atemp  l~l  Hqs(v), 
where  v  is  as  described  in  step  3 : .  The  outcome  of  the  execution  is  shown  in 
Figure  4.3(b). 
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In  the  interest  of  brevity  we  do  not  include  the  proof  here  and  instead 
refer  the  reader  to  [Ganguli  et  al.,  2007b].  We  just  mention  that  fact  (iii)  is 
a  consequence  of  the  fact  that  all  relevant  concavities  in  the  computation  of 
7tvis-disk(p^ ,  pH ;  Qs)  are  visible  from  both  agents  pM  and  p H .  We  are  finally 
ready  to  provide  analogs  of  Definition  4.4  and  Lemma  4.5. 

Definition  4.10  (Line-of-sight  connectivity  constraint  set).  Consider 
a  nonconvex  allowable  environment  Qs  and  two  agents  i  and  j  within  range- 
limited  line  of  sight.  We  call  fbvis_rjisk (pM ,  pH ;  Qs)  the  pairwise  line-of-sight 
connectivity  constraint  set  of  agent  i  with  respect  to  agent  j.  Furthermore, 
given  agents  at  positions  V  =  {pM, . . .  ,  pH}  C  Qs  that  are  all  within  range- 
limited  line  of  sight  of  agent  i,  the  line-of-sight  connectivity  constraint  sets 
of  agent  i  with  respect  to  V  is 

^vis-disk  (P®,V-,QS)  =  m  ^vis-disk  |geP\{pw}}. 


The  following  result  is  a  consequence  of  Proposition  4.9. 

Lemma  4.11  (Maintaining  network  line-of-sight  connectivity).  Con¬ 
sider  a  group  of  agents  at  positions  V(t)  =  {pH  (£),..., pH  (£)}  c  Qs  at  time 
£.  If  each  agent’s  control  v$(£)  takes  value  in 

u®(£)  G  TVis-disk (p®(*),'P(*);Qs) - p[i](£ ),  i  G  {1, . . . ,  n}, 

then,  according  to  the  discrete-time  motion  model  (4.1.1) 

(i)  each  agent  remains  in  its  constraint  set,  that  is, 

p®(£  +  i)  €  AWdisk^W, 

(ii)  each  edge  of  5vis-disk,Qs  at  V(£)  is  maintained  after  the  motion  step, 
i.e.,  if  p^  and  pH  are  within  range-limited  line  of  sight  at  time  £, 
then  they  are  so  also  at  time  £  -\-  1; 

(in)  if  Gvis-disk,Q6  at  V{£)  is  connected,  then  £vis-disk ,qs  at  V{£  +  1)  is 
connected;  and 

(iv)  the  number  of  connected  components  of  the  graph  ^viS-disk,Q5  atV(£+ 

1)  is  equal  to  or  smaller  than  the  number  of  connected  components 
of  the  graph  ^vis-disk.Q*  at  V{£). 

Remark  4.12  (Constraints  for  relative-sensing  networks:  cont’d). 

Following  up  on  Remarks  4.3  and  4.6,  we  consider  a  relative-sensing  network 
with  range- limited  visibility  sensors,  see  Example  3.16.  To  compute  the 
connectivity  constraint  set  for  this  network,  it  suffices  to  provide  a  relative 
sensing  version  of  the  iterated  truncation  ALGORITHM: 

function  RELATIVE-SENSING  ITERATED  TRUNCATION^;  yenv) 
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%  Executed  by  robot  i  with  range-limited  visibility  sensor: 
robot  measurement  is  y  =  p^  E  Vidisk(02!  (Qs)i)  for  j  7^  * 
environment  measurement  is  yenv  =  Vidisk(02;  (Qs)i) 

1 :  ^temp  • =  Venv  “2“  ?  2  ) 

2:  while  ()Aiemp  contains  a  concavity  do 

3:  v  :=  a  strictly  concave  point  of  <9-Hemp  closest  to  [02,y] 

4:  ddemp  -=  -Hemp  O  Hymv  [V ) 

5:  return  -Hemp 

The  algorithm  output  is  <Tvis_disk(0d,  y),  for  y  =  G  Vidisk(02;  (Qa)*)-  • 

Next,  we  relax  the  constraints  in  Definition  4.10  to  provide  the  network 
nodes  with  larger,  and  therefore  less  conservative,  motion  constraint  sets. 
Similarly  to  Section  4.2.2,  we  seek  to  enforce  the  preservation  of  a  fewer 
number  of  range-limited  line-of-sight  links  while  still  making  sure  that  the 
overall  network  connectivity  is  preserved.  To  do  this,  we  recall  from  Sec¬ 
tion  2.2  the  notion  of  locally-cliqueless  graph  Q\cg  of  a  proximity  graph  Q . 
This  proximity  graph  is  illustrated  in  Figure  2.11.  Let  us  use  the  short¬ 
hand  notation  £ic-vis-disk,Q5  =  £lc,Svia.dlsk,Qi5  •  From  Theorem  2.12(h)  and  (iii), 
respectively,  recall  that  ^ic-vis-disk,Qa  has  the  following  properties: 

(i)  It  has  the  same  connected  components  as  ^vis-disk, Qs ,  that  is,  for  all 
point  sets  V  C  Md,  the  graph  has  the  same  number  of  connected 
components  consisting  of  the  same  vertices;  and 

(ii)  it  is  spatially  distributed  over  5vis-disk,Qa  • 

Because  of  (i),  to  maintain  or  decrease  the  number  of  connected  components 
of  a  range-limited  visibility  graph,  it  is  sufficient  to  maintain  or  decrease  the 
number  of  connected  components  of  the  sparser  graph  f/ic-vis-disk,Q5  •  Because 
of  (ii),  it  is  possible  for  each  agent  to  determine  which  of  its  neighbors  in 
^vis-disk, Qs  are  its  neighbors  also  in  ^ic-vis-disk,Qs  •  We  formalize  this  discussion 
as  follows. 

Definition  4.13  (Locally-cliqueless  line-of-sight  connectivity  con¬ 
straint  set).  Consider  a  nonconvex  allowable  environment  Qs  C  K2  and  a 
group  of  agents  at  positions  V  =  {pW, . . .  ,p C  Q.  The  locally-cliqueless 
line-of-sight  connectivity  constraint  set  of  agent  i  with  respect  to  V  is 

“He- vis-disk  (P^  j  T* ,  Q S )  —  {  ■*'  E  “Hris-disk  (P  -  (h  Qk  )  | 

qeV  s.t.  (<?,pW)  is  an  edge  of  £ic-vis-disk,Q5 CP)}-  • 


The  following  result  is  a  direct  consequence  of  the  previous  arguments. 

Lemma  4.14  (Maintaining  connectivity  of  sparser  networks).  Con- 

188 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


sider  a  group  of  agents  at  positions  V{t)  =  {pW  (£),...,  pM  (£)}  c  Q5  at  time 
I.  If  each  agent’s  control  takes  value  in 

u^(l)  E  Xic-yis-d^{p[{]^),V(£fQs)  -Pw(€),  z  E  {1, . . .  ,n}, 
then,  according  to  the  discrete-time  motion  model  (4.1.1) 

(i)  each  agent  remains  in  its  locally- cliqueless  line-of-sight  connectivity 
constraint  set; 

(ii)  two  agents  that  are  in  the  same  connected  component  of  (?ic-vis-disk,Qa 
remain  at  the  same  connected  component  after  the  motion  step;  and 

(Hi)  the  number  of  connected  components  of  the  graph  (/ic-vis-disk,Q5  at 
V(£  +  1)  is  equal  to  or  smaller  than  the  number  of  connected  com¬ 
ponents  of  the  graph  ^lc-vis-disk.Q^  at  V{£). 

4.3  RENDEZVOUS  ALGORITHMS 

In  this  section  we  present  some  algorithms  that  might  be  used  by  a  robotic 
network  to  achieve  rendezvous.  Throughout  the  section  we  mainly  focus  on 
the  uniform  network  5disk  of  locally-connected  first-order  agents  in  this 
robotic  network  was  introduced  in  Example  3.4. 


4.3.1  Averaging  control  and  communication  law 

We  first  study  a  behavior  in  which  agents  move  towards  a  position  computed 
as  the  average  of  the  received  messages.  This  law  is  related  to  the  distributed 
linear  algorithms  discussed  in  Section  1.6  and,  in  particular,  to  adjacency- 
based  agreement  algorithms  and  the  Vicsek’s  model.  This  algorithm  has  also 
been  studied  in  the  context  of  “opinion  dynamics  under  bounded  confidence” 
and  is  known  in  the  literature  as  the  Krause  model. 

We  loosely  describe  the  averaging  law,  that  we  denote  by  CCAVBraging> 
as  follows: 


[Informal  description]  At  each  communication  round  each  agent 
performs  the  following  tasks:  (i)  it  transmits  its  position  and 
receives  its  neighbors’  positions;  (ii)  it  computes  the  average  of 
the  point  set  comprised  of  its  neighbors  and  of  itself.  Between 
communication  rounds,  each  robot  moves  toward  the  average 
point  it  computed. 


We  next  formulate  the  algorithm  using  the  description  model  of  Chap- 
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ter  3.  The  law  is  uniform,  static,  and  data-sampled,  with  standard  message- 
generation  function. 


Robotic  Network:  5disk  with  motion  model  (4.1.1)  in  Rd, 

with  absolute  sensing  of  own  position,  and 
with  communication  range  r 

Distributed  Algorithm:  AVERAGING 
Alphabet:  A  =  M.d  U{null} 

function  msg (p,i) 
i:  return  p 

function  ctl (p,y) 

i:  return  avrg({p}  U{prcvd  |  Prcvd  is  a  non-null  message  in  y})  —  p 


An  implementation  of  this  control  and  communication  law  is  shown  in 
Figure  4.5  for  d  =  1.  Note  that,  along  the  evolution,  (1)  several  robots 
rendezvous ,  i.e. ,  agree  upon  a  common  location,  and  (2)  some  robots  are 
connected  at  the  simulation’s  beginning  and  not  connected  at  the  simula¬ 
tion’s  end.  Our  analysis  of  the  performance  of  this  law  is  contained  in  the 


Figure  4.5  Evolution  of  a  robotic  network  under  the  averaging  control  and  communi¬ 
cation  law  in  Section  4.3.1  implemented  over  the  network  Sdisk,  with  r  =  1.5. 
The  vertical  axis  corresponds  to  the  elapsed  time,  and  the  horizontal  axis  to 
the  positions  of  the  agents  in  the  real  line.  The  51  agents  are  initially  randomly 
deployed  over  the  interval  [—15, 15]. 

following  theorem,  whose  proof  is  postponed  to  Section  4.6.1. 

Theorem  4.15  (Correctness  and  time  complexity  of  averaging  law). 

For  d  =  1,  the  network  5disk,  the  law  CCaveraging  achieves  the  task  Trn(\zvs 

with  time  complexity 

TC(/7)ndzvS;  CC averaging)  tr  0(n5), 

TC(7^ndzvs>  CCaveraging  )  G  O(n). 
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4.3.2  Circumcenter  control  and  communication  laws 

Here  we  define  the  circumcenter  control  and  communication  law  for  the 
network  5disk-  The  law  solves  the  rendezvous  problem  while  maintaining 
the  network  connected.  This  law  was  introduced  by  Ando  et  al.  [1999]  and 
later  studied  by  Lin  et  al.  [2007a],  Cortes  et  al.  [2006]. 

It  is  convenient  to  recall  two  useful  geometric  concepts:  (i)  given  a  bounded 
set  S,  its  circumcenter  CC(S)  is  the  center  of  the  closed  ball  of  minimum 
radius  containing  S  (see  Section  2.1.3);  (ii)  given  a  point  p  in  a  convex  set  Q 
and  a  second  point  q,  the  from-to-inside  map  fti(p,  q,  S )  is  the  point  in  the 
closed  segment  [p,  q]  which  is  at  the  same  time  closest  to  q  and  inside  S  (see 
Section  2.1.1).  Finally,  recall  also  the  connectivity  constraint  set  introduced 
in  Definition  4.4. 

We  loosely  describe  the  circumcenter  law,  that  we  denote  by  CCcircumcenter, 
as  follows: 


[Informal  description]  At  each  communication  round  each  agent 
performs  the  following  tasks:  (i)  it  transmits  its  position  and  re¬ 
ceives  its  neighbors’  positions;  (ii)  it  computes  the  circumcenter 
of  the  point  set  comprised  of  its  neighbors  and  of  itself.  Between 
communication  rounds,  each  robot  moves  toward  this  circum¬ 
center  point  while  maintaining  connectivity  with  its  neighbors 
using  appropriate  connectivity  constraint  sets. 


We  next  formulate  the  algorithm  using  the  description  model  of  Chap¬ 
ter  3.  The  law  is  uniform,  static,  and  data-sampled,  with  standard  message- 
generation  function. 


Robotic  Network:  5disk  with  discrete-time  motion  model  (4.1.1), 
with  absolute  sensing  of  own  position,  and 
with  communication  range  r,  in  M.d 

Distributed  Algorithm:  CIRCUMCENTER 
Alphabet:  A  =  MdU{null} 

function  msg (p,i) 
l:  return  p 

function  ctl (p,y) 

1:  Pgoal  :=  CC({p}U{prcvd  I  for  all  non-null  prcvd  <E  y}) 

2:  A  :=  Adisk(p,  {prcvd  |  for  all  non-null  prcvd  £  y}) 
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3:  return  fti(p,pgoai,  X)  -  p 


This  algorithm  is  illustrated  in  Figure  4.6. 


Figure  4.6  Illustration  of  the  execution  of  CIRCUMCENTER.  Each  row  of  plots  represents 
an  iteration  of  the  law.  At  each  round,  each  agent  computes  its  goal  point 
and  its  constraint  set,  and  then  moves  towards  the  goal  while  remaining  in  the 
constraint  set. 


Remark  4.16  (Relative  sensing  version).  It  is  possible  and  straightfor¬ 
ward  to  implement  the  circumcenter  law  as  a  static  relative-sensing  control 
law  on  the  relative-sensing  network  with  disk  sensors  introduced  in 

Example  3.15 


Relative  Sensing  Network:  5^;sk  with  motion  model  (4.1.2), 
no  communication,  relative  sensing  for  robot  i  given  by: 
robot  measurements  y  contains  S  R(02,r)  for  all  j  i 

Distributed  Algorithm:  RELATIVE-SENSING  CIRCUMCENTER 

function  ctl(y) 

1:  Pgoal  :=  CC({0rf}  U{psnsd  I  for  all  non-null  psnsd  €  y}) 

2:  X  :=  Tdisk(0 d,{Psnsd  \  for  all  non-null  psnsd  G  y}) 

3:  return  fti(0d,pgOai,  AO 


In  the  remainder  of  this  section,  we  generalize  the  circumcenter  law  in  a 
number  of  ways:  (i)  we  modify  the  constraint  set  by  imposing  bounds  on 
the  control  inputs  and  by  relaxing  the  connectivity  constraint  as  much  as 
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possible,  while  maintaining  connectivity  guarantees;  and  (ii)  we  implement 
the  circumcenter  law  on  two  distinct  communication  graphs.  Let  us  note 
that  many  of  these  generalized  circumcenter  laws  can  also  be  implemented 
as  relative-sensing  control  laws;  we  do  not  present  the  details  in  the  interest 
of  brevity. 


4.3.2. 1  Circumcenter  law  with  control  bounds  and  relaxed  connectivity  con¬ 
straints 


First,  we  assume  that  the  agents  have  a  compact  input  space  U  =  B(0d,  umax), 
with  umax  G  M>o-  Additionally,  we  adopt  the  relaxed  Q -connectivity  con¬ 
straint  sets  as  follows.  Let  Q  be  a  proximity  graph  that  is  spatially  dis¬ 
tributed  over  t/disk(?")  and  that  has  the  same  connected  components  as 
£disk (r);  examples  include  £rn  H  £disk(d>  Gg  H  £/disk(V)  and  <7ld (r).  Recall 
the  (/-connectivity  constraint  set  from  Definition  4.7.  Combining  the  relaxed 
connectivity  constraint  and  the  control  magnitude  bound,  we  redefine  the 
control  function  in  the  CIRCUMCENTER  law  to  be: 


function  ctl (p,y) 

%  Includes  control  bound  and  relaxed  Q -connectivity  constraint 
1:  Pgoal  :=  CC(M  U{prcvd  |  for  all  non-null  prcvd  G  y})_ 

2:  X  :=  Xdlsk,g (p,  {prcvd  |  for  all  non-null  prcvd  G  y})  C  B(p,  umax) 
3:  return  fti(p,pgoal,  X)  -p 


Second,  the  circumcenter  law  can  be  implemented  also  on  robotic  net¬ 
works  with  different  proximity  graphs.  For  example,  we  can  implement  the 
circumcenter  algorithm  without  any  change  on  the  following  network. 


4. 3.2.2  Circumcenter  law  on  the  limited  Delaunay  graph 

We  consider  the  same  set  of  physical  agents  as  in  the  5disk-  For  r  G  M>o, 
we  adopt  as  communication  graph  the  r -limited  Delaunay  graph  (/ldM; 
described  in  Section  2.2.  These  data  define  the  uniform  robotic  network 
5ld  =  ({1 , ,n},7Z,  -Eld),  as  described  in  Example  3.4.  On  this  network, 
we  implement  the  CIRCUMCENTER  law  without  any  change,  that  is,  with  the 
same  message-generation  and  control  function  as  we  did  for  the  implemen¬ 
tation  on  the  network  5disk- 
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4. 3.2. 3  Parallel  circumcenter  law  on  the  oo-disk  graph 

We  consider  the  network  5oo-disk  of  first-order  robots  in  Wl  connected  ac¬ 
cording  to  the  ^oo-disk(^)  graph,  see  Example  3.4.  For  this  network  we  define 
the  pll-CRCMCNTR  law,  that  we  denoted  by  CCpll_CRCMcntr,  by  designing  d 
decoupled  circumcenter  laws  running  in  parallel  on  each  coordinate  axis  of 
M.d.  As  before,  this  law  is  uniform  and  static.  What  is  remarkable,  however, 
is  that  no  constraint  is  required  to  maintain  connectivity,  see  Exercise  E4.4. 

The  parallel  circumcenter  of  the  set  S,  denoted  by  PCC(S'),  is  the  center  of 
the  smallest  axis-aligned  rectangle  containing  S.  In  other  words,  PCC(S’)  is 
the  component- wise  circumcenter  of  S,  see  Figure  4.7.  We  state  the  parallel 


Figure  4.7  The  gray-colored  point  is  the  parallel  circumcenter  of  the  collection  of  black- 
colored  points. 


circumcenter  law  as  follows. 


Robotic  Network:  5oo-disk  with  discrete-time  motion  model  (4.1.1)  in  Rd, 
with  absolute  sensing  of  own  position,  and 
with  communication  range  r  in  L°°-metric 

Distributed  Algorithm:  PLL-CRCMCNTR 
Alphabet:  A  =  M.d  U{null} 

function  msg(p,  i) 
i:  return  p 

function  ctl (p,y) 

1:  Pgoal  :=  PCC({p}U{prcvd  I  for  all  non-null  prcvd  G  y}) 

2:  return  pgoal  -  p 
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4.3.3  Correctness  and  complexity  of  circumcenter  laws 

In  this  section  we  characterize  the  convergence  and  complexity  properties 
of  the  circumcenter  law  and  of  its  variations.  The  following  theorem  sum¬ 
marizes  the  results  known  in  the  literature  about  the  asymptotic  proper¬ 
ties  of  the  circumcenter  law.  The  robustness  of  the  circumcenter  control 
and  communication  laws  can  be  characterized  with  respect  to  link  failures; 
see  [Cortes  et  ah,  2006]. 

Theorem  4.17  (Correctness  of  the  circumcenter  laws).  For  d  G  N, 

r  G  M>o  and  £  G  M>o,  the  following  statements  hold: 

(i)  on  the  network  S^isk,  the  law  CC circumcenter  (with  control  magnitude 
bounds  and  relaxed  Q -connectivity  constraints)  achieves  the  exact 
rendezvous  task  %n^zws; 

(ii)  on  the  network  SkD,  the  lawCCc ircumcenter  achieves  the  e-rendezvous 
task  77  rj[(]zvs .  and 

(Hi)  on  the  network  <Soo-disk;  l°/w  CCpll_crcmcntr  achieves  the  exact 
rendezvous  task  %n&zvs- 

Furthermore,  the  evolutions  o/(5diskj  CC  circumcenter);  °/(5ld>CC  CIRCUMCENTER 
and  of  (cSoo-disk,  CCpll_CRCmcntr)  have  the  following  properties: 

(iv)  if  any  two  agents  belong  to  the  same  connected  component  of  the 
respective  communication  graph  at  £  €  Z >o ,  then  they  continue  to 
belong  to  the  same  connected  component  for  all  subsequent  times 
k  >  t;  and 

(v)  for  each  evolution,  there  exists  P*  =  {p\, . . .  ,p*n)  G  (M.d)n  such  that: 

(a)  the  evolution  asymptotically  approaches  P* ,  and 

(b)  for  each  i,j  G  {1, . . .  ,n},  either  p*  =  p*,  or  || p*  —  p*  || 2  >  r 
(for  the  networks  tSdisk  and  Sld)  or  \\p*  —  p*\\oo  >  f  (for  the 
network  Soo-diskj- 

The  proof  of  this  theorem  is  given  in  Section  4.6.2.  Next,  we  analyze 
the  time  complexity  of  CCcircumcenter.  As  we  see  next,  the  complexity  of 
CCCIRCUMCenter  differs  dramatically  when  applied  to  robotic  networks  with 
different  communication  graphs.  We  provide  complete  results  for  the  net¬ 
works  5disk  and  5ld  only  for  the  case  d  =  1. 

Theorem  4.18  (Time  complexity  of  circumcenter  laws).  Forr  G  M>o 
and  e  G  ]0, 1[,  the  following  statements  hold: 
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(i)  on  the  network  iSdisk;  evolving  on  the  real  line  R  (i.e.,  with  d  =  1), 
TC(7^ndZvs, CCCiRcumcenter)  ^  &(n); 

(ii)  on  the  network  5ld>  evolving  on  the  real  line  R  (i.e.,  with  d  =  1), 
TC  ('d'(re)-rndzvs  :  CC  circumcenter)  £  0(?r2  log(ne  1));  and 

(Hi)  on  the  network  5oo_disk,  evolving  on  Euclidean  space  (i.e.,  with  d  £ 
Nj,  TC(7^ndZvs)  CCpll_CrCmcntr)  £  0(n). 

The  proof  of  this  result  is  contained  in  [Martinez  et  al.,  2007b]. 

Remark  4.19  (Analysis  in  higher  dimensions).  The  results  in  The¬ 
orem  4. 18(i)  and  (ii)  induce  lower  bounds  on  the  time  complexity  of  the 
circumcenter  law  in  higher  dimensions.  Indeed,  for  arbitrary  d  >  1,  we  have 

(i)  on  the  network  5disk>  TC(7jndzvs> CC circumcenter)  £  hl(n); 

(ii)  on  the  network  5Ld,  TC(T(r£)_mdzvs,CCciRcuMCENTER)  £  0(n2  log(ne-1)). 

We  have  performed  extensive  numerical  simulations  for  the  case  d  =  2 
and  the  network  5disk-  We  run  the  algorithm  starting  from  generic  ini¬ 
tial  configurations  (where,  in  particular,  robots’  positions  are  not  aligned) 
contained  in  a  bounded  region  of  R2.  We  have  consistently  obtained  that 
the  time  complexity  to  achieve  Trn^zws  with  CCCIrCUMcenter  starting  from 
these  initial  configurations  is  independent  of  the  number  of  robots.  This 
leads  us  to  conjecture  that  initial  configurations  where  all  robots  are  aligned 
(equivalently,  the  1-dimensional  case)  give  rise  to  the  worst  possible  per¬ 
formance  of  the  algorithm.  In  other  words,  we  conjecture  that,  for  d  >  2, 

TC(7j.ndZvs,  CCciRCUMCENTER  )  =  0(n). 

Remark  4.20  (Congestion  effects).  As  discussed  in  Remark  3.8,  one 
way  of  incorporating  congestion  effects  into  the  network  operation  is  to  as¬ 
sume  that  the  parameters  of  the  physical  components  of  the  network  depend 
upon  the  number  of  robots.  For  instance,  by  assuming  that  the  communi¬ 
cation  range  decreases  with  the  number  of  robots.  Theorem  4.18  presents 
an  alternative,  equivalent  way  of  looking  at  congestion:  the  results  hold  un¬ 
der  the  assumption  that  the  communication  range  is  constant,  but  allow  for 
the  diameter  of  the  initial  network  configuration  (the  maximum  inter-agent 
distance)  to  grow  unbounded  with  the  number  of  robots.  • 


4.3.4  Circumcenter  law  in  nonconvex  environments 

In  this  section  we  adapt  the  circumcenter  algorithm  to  work  on  networks 
in  planar  nonconvex  allowable  environments.  Throughout  the  section,  we 

196 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


only  consider  the  case  of  a  compact  allowable  nonconvex  environment  Q 
contracted  into  Qs  for  a  small  positive  5.  We  present  the  algorithm  in 
two  formats:  for  the  communication-based  network  5vis_disk  described  in 
Example  3.6  and  for  the  relative-sensing  network  5^?  disk  described  in  Ex¬ 
ample  3.16. 

We  modify  the  circumcenter  algorithm  in  three  ways:  first,  we  adopt  the 
connectivity  constraints  described  in  the  previous  section  for  range-limited 
line-of-sight  links;  second,  we  further  restrict  the  robot  motion  to  remain 
inside  the  relative  convex  hull  of  the  sensed  robot  positions;  and  third,  we 
move  towards  the  circumcenter  of  the  constraint  set,  instead  of  the  circum¬ 
center  of  the  neighbors  positions.  The  algorithm  details  are  as  follows. 


Robotic  Network:  5vis_disk  with  discrete-time  motion  model  (4.1.1), 
absolute  sensing  of  own  position  and  of  Qs,  and 
communication  range  r  within  line  of  sight  (^vis-disk, Q6 ) 

Distributed  Algorithm:  NONCONVEX  CIRCUMCENTER 
Alphabet:  A  =  M2  U{null} 

function  msg(p,  i ) 
l:  return  p 

function  ctl (p,y) 

1:  A  |  .  'Tvis_disk(.P>  {Prcvd  I  for  dll  HOIl-llllll  prcv(j  £  Ij \  •  Qs) 

2:  Xi  :=  rco({p}  U{prcvd  |  for  all  non-null  prcvd  £  y}]Vi(p;Qs)) 

3:  Pgoal  :=  CC(Ai  n  X2) 

4:  return  fti(p,pgoai,  B(p,  Rmax))  -  p 


[zl 

Next,  we  present  the  relative  sensing  version;  recall  that  p\  =  O2  and  that, 
as  discussed  in  Section  3.2.3  about  the  evolution  of  a  relative  sensing  net¬ 
work  with  environment  sensors,  yenv  denotes  the  environment  measurement 
provided  by  the  range-limited  visibility  sensor. 


Relative  Sensing  Network:  5(ds_disk  with  motion  model  (4.1.2)  in  Qs, 
no  communication,  relative  sensing  for  robot  i  given  by: 
robot  measurements  y  contains  p\^  £  Vidisk(02!  ( Qs)i )  f°r  j/* 
environment  sensing  is  yenv  =  Vidisk(02;  (Qs)i) 

Distributed  Algorithm:  NONCONVEX  RELATIVE-SENSING  CIRCUMCEN¬ 
TER 

function  ctl(y,  yenv) 

l:  X1  :=  A’vis_disk(02,{Psnsd  I  for  all  non- null  psnsd  £  y}-,yenv ) 
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2:  X2  :=  rco({02}U{psnsd  |  for  all  non-null  psnsd  £  y};yenv) 
3:  Pgoal  :=  CC(X1nX2)_ 

4:  return  fti(02,pgoal, -6(02  >  ^max)  ) 


Theorem  4.21  (Correctness  of  the  circumcenter  law  in  nonconvex 
environments).  For  5  >  0,  let  Qs  be  a  contraction  of  a  compact  allowable 
nonconvex  environment  Q.  For  r  E  M>o  and  e  £  M>o,  on  the  network 
vis-disk •  the  law  (/(/nonconvex  circumcenter  (with  control  magnitude  bounds J 
achieves  the  e-rendezvous  task  Te_ rndzvs-  Furthermore,  the  evolution  has  the 
following  properties: 

(i)  if  any  two  agents  belong  to  the  same  connected  component  of  the 
graph  ^vis-disk,Qa  at  £  £  Z>o,  then  they  continue  to  belong  to  the 
same  connected  component  for  all  subsequent  times  k  >  i;  and 

(ii)  there  exists  P*  =  (p\, . . .  ,p*n)  £  Q'f  such  that: 

(a)  the  evolution  asymptotically  approaches  P* ,  and 

(b)  for  each  i,j  £  {1, . . .  ,n},  either  p*  =  p*,  or  p*  and  p*  are 
not  within  range-limited  line  of  sight 

The  proof  of  this  result  can  be  found  in  [Ganguli  et  al.,  20071)].  A  brief 
sketch  of  the  proof  steps  is  presented  in  Section  4.6.4.  The  complexity  of 
nonconvex  circumcenter  law  has  not  been  characterized.  However,  note 
that  the  evolution  from  any  initial  configuration  such  that  Qv\s,Qs  is  complete 
is  also  an  evolution  of  the  circumcenter  law  discussed  in  Section  4.3.2, 
and  hence  Theorem  4. 18(i)  induces  a  lower  bound  on  the  time  complexity. 

4.4  SIMULATION  RESULTS 

In  this  section,  we  illustrate  the  execution  of  the  various  circumcenter  control 
and  communication  laws  introduced  in  this  chapter.  The  circumcenter 
law  is  implemented  on  the  networks  5disk,  ‘Situ  anc^  ‘Soo-disk  in  Mathematica® 
as  a  library  of  routines  and  a  main  program  running  the  simulation.  The 
package  PlanGeom.m  and  SpatialGeom.m  contain  routines  for  the  compu¬ 
tation  of  geometric  objects  in  M2  and  R3,  respectively.  These  routines  are 
freely  available  at  the  book  webpage  http://coordinationbook.info 

We  show  evolutions  of  (5diskj  circumcenter)  in  two  and  three  dimen¬ 
sions  in  Figures  4.8  and  4.9,  respectively. 

Measuring  displacements  in  meters,  we  consider  random  initial  positions 
over  the  square  [—7,  7]  x  [—7,  7]  and  the  cube  [—7,  7]  x  [—7,  7]  x  [—7,  7].  The 
25  robotic  agents  have  a  communication  radius  r  =  4  and  a  compact  input 
space  U  =  B(0 d,umSLX),  with  umax  =  .15.  As  the  simulations  show,  the  task 
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Figure  4.8  Evolution  of  (Sdisk,  CIRCUMCENTER)  with  n  =  25  robots  in  2  dimensions,  (a) 
shows  the  initial  connected  network  configuration  and  (b)  shows  the  evolution 
of  the  individual  agents  until  rendezvous  is  achieved. 


(a)  (b) 

Figure  4.9  Evolution  of  (Sdisk,  CIRCUMCENTER)  with  n  =  25  robots  in  3  dimensions,  (a) 
shows  the  initial  connected  network  configuration  and  (b)  shows  the  evolution 
of  the  individual  agents  until  rendezvous  is  achieved. 
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^mdzvs  is  achieved,  as  guaranteed  by  Theorem  4.17 (i) . 

Within  the  same  setup,  we  also  show  an  evolution  of  (<Sld>  CIRCUMCENTER) 
in  two  dimensions  in  Figure  4.10.  As  the  simulation  shows,  the  task  7^_rndzvs 
is  achieved,  as  guaranteed  by  Theorem  4.17(ii). 


Figure  4.10  Evolution  of  (Sld,  CIRCUMCENTER)  with  n  =  25  robots  in  2  dimensions,  (a) 
shows  the  initial  connected  network  configuration  and  (b)  shows  the  evolution 
of  the  individual  agents  until  rendezvous  is  achieved. 


Finally,  we  show  an  evolution  of  (cSoo.diski  pll-CRCMCNTr)  in  two  dimen¬ 
sions  in  Figure  4.11.  As  the  simulations  show,  the  task  7^n(jzvs  is  achieved, 
as  guaranteed  by  Theorem  4.17 (iii) . 

4.5  NOTES 

The  rendezvous  problem  and  the  circumcenter  algorithm  were  originally 
introduced  by  Ando  et  al.  [1999].  The  circumcenter  algorithm  has  been  ex¬ 
tended  to  other  control  policies,  including  asynchronous  implementations, 
in  [Lin  et  al.,  2007a, b].  The  circumcenter  algorithm  has  been  extended 
beyond  planar  problems  to  arbitrary  dimensions  in  [Cortes  et  al.,  2006], 
where  its  robustness  properties  are  also  characterized.  Regarding  Theo¬ 
rem  4.17,  the  results  on  5disk  appeared  originally  in  [Ando  et  al.,  1999];  the 
results  on  5ld  and  on  cSoo.disk  appeared  originally  in  [Cortes  et  al.,  2006]  and 
in  [Martinez  et  al.,  2007b],  respectively.  Variations  of  the  circumcenter  law 
in  the  presence  of  noise  and  sensor  errors  are  studied  in  [Martinez,  2008]. 
The  continuous-time  version  of  the  circumcenter  law,  with  no  connectivity 
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(a) 


(b) 


Figure  4.11  Evolution  of  (Soo-disk,  pll-CRCMCNTr)  with  n  =  25  robots  in  2  dimensions. 

(a)  shows  the  initial  connected  network  configuration  and  (b)  shows  the  evo¬ 
lution  of  the  individual  agents  until  rendezvous  is  achieved. 


constraints,  is  analyzed  in  [Lin  et  al.,  2007c].  Continuous-time  control  laws 
for  groups  of  robots  with  simple  first-order  dynamics  and  unicycle  dynam¬ 
ics  are  proposed  in  [Lin  et  al.,  2004,  2005,  Dimarogonas  and  Kyriakopoulos, 

2007].  In  these  works,  the  inter-robot  topology  is  time-dependent  and  as¬ 
sumed  a  priori  to  be  connected  at  all  times.  Rendezvous  under  communica¬ 
tion  quantization  is  studied  in  [Fagnani  et  al.,  2004,  Carli  and  Bullo,  2007]. 
Rendezvous  for  unicycle  robots  with  minimal  sensing  capabilities  is  studied 
by  Yu  et  al.  [2008b].  Relationships  with  classic  curve-shortening  flows  are 
studied  by  Smith  et  al.  [2007]. 

Rendezvous  has  also  been  studied  within  the  computer  science  literature, 
where  the  problem  is  referred  to  as  the  “gathering”  or  point  formation  prob¬ 
lem.  Flocchini  et  al.  [1999],  Suzuki  and  Yamashita  [1999]  study  the  point 
formation  problem  under  the  assumption  that  each  robot  is  capable  of  sens¬ 
ing  all  other  robots.  Flocchini  et  al.  [2005]  propose  asynchronous  algorithms 
to  solve  the  gathering  problem,  and  Agmon  and  Peleg  [2006]  study  the  solv¬ 
ability  of  the  problem  in  the  presence  of  faulty  robots. 

Multirobot  rendezvous  with  line-of-sight  sensors  is  considered  in  [Roy  and  Dudek, 
2001],  where  solutions  are  proposed  based  on  the  exploration  of  the  un¬ 
known  environment  and  the  selection  of  appropriate  rendezvous  points  at 
pre-specified  times.  The  paper  [Hayes  et  al.,  2003]  also  consider  rendezvous 
at  a  specified  location  for  visually-guided  agents,  but  the  proposed  solution 
requires  each  agent  to  have  knowledge  of  the  location  of  all  other  agents. 
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The  problem  of  computing  a  multirobot  rendezvous  point  in  polyhedral  sur¬ 
faces  made  of  triangular  faces  is  considered  in  [Lanthier  et  ah,  2005].  The 
perimeter  minimizing  algorithm  presented  by  Ganguli  et  al.  [2007b]  solves 
the  rendezvous  problem  for  sensor-based  networks  with  line-of-sight  range- 
limited  sensors  in  nonconvex  environments. 

Regarding  the  connectivity  maintenance  problem,  a  number  of  works  have 
addressed  the  problem  of  designing  a  coordination  algorithm  that  achieves 
a  general,  non-specified  task  while  preserving  connectivity.  The  centralized 
solution  proposed  in  [Zavlanos  and  Pappas,  2005]  allows  for  a  general  range 
of  agent  motions.  The  distributed  solution  presented  by  [Savla  et  ah,  2007b] 
gives  connectivity  maintaining  constraints  for  second-order  control  systems 
with  input  magnitude  bounds.  A  distributed  algorithm  to  perform  graph  re¬ 
arrangements  that  preserve  the  connectivity  is  presented  in  [Schuresko  and  Cortes, 
2007].  Connectivity  problems  have  been  studied  also  in  other  contexts. 
Langbort  and  Gupta  [2007]  study  the  impact  of  the  connectivity  of  the  inter¬ 
connection  topology  in  a  class  of  network  optimization  problems.  Spanos  and  Murr 
[2005]  generate  connectivity-preserving  motions  between  pairs  of  formations. 

.Ji  and  Egerstedt  [2007]  design  Laplacian-based  control  laws  to  solve  forma¬ 
tion  control  problems  while  preserving  connectivity.  Various  works  have 
focused  on  designing  the  network  motion  so  that  some  desired  measure  of 
connectivity  (e.g.,  algebraic  connectivity)  is  maximized  under  position  con¬ 
straints.  Boyd  [2006],  de  Gennaro  and  Jadbabaie  [2006]  consider  convex 
constraints,  while  Kim  and  Mesbahi  [2006]  deal  with  a  class  of  nonconvex 
constraints.  Zavlanos  and  Pappas  [2007b]  use  potential  fields  to  maximize 
algebraic  connectivity. 

A  continuous-time  version  of  the  averaging  control  and  communication  law 
is  also  known  as  the  Hegselmann-Krause  model  for  “opinion  dynamics  under 
bounded  confidence;”  see  [Hegselmann  and  Krause,  2002,  Lorenz,  2007].  In 
this  model,  each  agent  may  change  its  opinion  by  averaging  it  with  that  of 
neighbors’  who  are  in  an  e-confidence  area.  In  other  words,  the  difference 
between  the  agent  opinion  and  its  neighbors’  should  be  bounded  by  e.  A 
similar  model  where  the  communication  between  agents  is  random  is  the 
Deffuant-Weisbuch  model,  inspired  by  a  model  of  dissemination  of  culture; 
see  [Deffuant  et  al.,  2000,  Axelrod,  1997]. 


4.6  PROOFS 

This  section  gathers  the  proofs  of  the  main  results  presented  in  the  chapter. 
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4.6.1  Proof  of  Theorem  4.15 

Proof.  One  can  easily  prove  that,  along  the  evolution  of  the  network,  the 
ordering  of  the  agents  is  preserved,  i.e. ,  the  inequality  pM  <  p ^  is  preserved 
at  the  next  time  step.  However,  links  between  agents  are  not  necessarily 
preserved  (e.g.,  see  Figure  4.8).  Indeed,  connected  components  may  split 
along  the  evolution.  However,  merging  events  do  not  occur.  Consider  two 
contiguous  connected  components  C\  and  C2  of  I/diskM)  with  C\  to  the  left 
of  C2.  By  definition,  the  rightmost  agent  in  the  component  C\  and  the 
leftmost  agent  in  the  component  C2  are  at  a  distance  strictly  larger  than  r. 
Now,  by  executing  the  algorithm,  they  can  only  but  increase  that  distance, 
since  the  rightmost  agent  in  C\  will  move  to  the  left,  and  the  leftmost  agent 
in  C*2  will  move  to  the  right.  Therefore,  connected  components  do  not  merge. 

Consider  first  the  case  of  an  initial  configuration  of  the  network  for  which 
the  communication  graph  remains  connected  throughout  the  evolution.  With¬ 
out  loss  of  generality,  assume  that  the  agents  are  ordered  from  left  to  right  ac¬ 
cording  to  their  identifier,  that  is,  pM(0)  <  •  •  •  <  pM(0).  Let  a  £  {3, . . . ,  n} 
have  the  property  that  agents  {2, . . . ,  a  —  1}  are  neighbors  of  agent  1,  and 
agent  a  is  not.  (If  instead  all  agents  are  within  an  interval  of  length  r,  then 
rendezvous  is  achieved  in  1  time  instant,  and  the  statement  in  theorem  is 
easily  seen  to  be  true.)  Note  that  we  can  assume  that  agents  {2, . . . ,  a  —  1} 
are  also  neighbors  of  agent  a.  If  this  is  not  the  case,  then  those  agents  that 
are  neighbors  of  agent  1  and  not  of  agent  a,  rendezvous  with  agent  1  at  the 
next  time  instant.  At  the  time  instant  t  =  1,  the  new  updated  positions 
satisfy 


a— 1 


i>|1|(i)  =  ^Ep|‘1(0)- 


k= 1 


P[7](  1)6  \~itp[k]( 0),*j,  7e{2,...,a-l}, 
La  J 


k= 1 


where  *  denotes  a  certain  unimportant  point. 
Now,  we  show  that 


p[1](a  -  1) -p[1](0)  >  — (4.6.1) 
a(a  —  1) 

Let  us  first  show  the  inequality  for  a  =  3.  Because  of  the  assumption  that 
the  communication  graph  remains  connected,  agent  2  is  still  a  neighbor  of 
agent  1  at  the  time  instant  t  =  1.  Therefore,  pW(2)  >  |(pW(l)  +pl2l(l)), 
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and  from  here  we  deduce 

P[1](2)-p[1](0)>^(p[2](l)-p[1](0)) 

>  ^ (| (p[1] (0)  +  p[2] (0)  +  p[3] (0))  -pW(0))  >  i(p[3](0)_p[i](0))  > 

Let  us  now  proceed  by  induction.  Assume  that  inequality  (4.6.1)  is  valid 
for  a  —  1,  and  let  us  prove  it  for  a.  Consider  first  the  possibility  when  at 
the  time  instant  l  =  1,  the  agent  a  —  1  is  still  a  neighbor  of  agent  1.  In  this 
case,  pW(2)  >  -R  J2k=i  p[fc]  (1) ,  and  from  here  we  deduce 

pW(2)  -pW(0)  >  1)  -pM(0))  >  ^(^P[fcl(0)  -P[1](0)) 

a  k= l 

>  /  1  n  (pM(0)  -p[1](0))  >  7  r  lV 

a(ce  —  1)  V  /  a  (a  —  1) 

which  in  particular  implies  (4.6.1).  Consider  then  the  case  when  agent  a—1 
is  not  a  neighbor  of  agent  1  at  the  time  instant  i  =  1.  Let  /3  <  a  such  that 
agent  (3  —  1  is  a  neighbor  of  agent  1  at  l  =  1,  but  agent  /3  is  not.  Since  (3  <  a, 
we  have  by  induction  pW(/3)  —  pM(l)  >  W-i)-  From  here,  we  deduce  that 
pW(a-  1)  -pN(0)  > 

It  is  clear  that  after  l\  =  a— 1,  we  could  again  consider  two  complementary 
cases  (either  agent  1  has  all  others  as  neighbors  or  not)  and  repeat  the  same 
argument  once  again.  In  that  way,  we  would  find  I2  such  that  the  distance 
traveled  by  agent  1  after  1 2  rounds  would  be  lower  bounded  by  n^r_^  ■ 
Repeating  this  argument  iteratively,  the  worst  possible  case  is  one  in  which 
agent  1  keeps  moving  to  the  right  and,  at  each  time  step,  there  is  always 
another  agent  which  is  not  a  neighbor.  Since  the  diameter  of  the  initial 
condition  Po  is  upper  bounded  by  (n  —  1  )r,  in  the  worst  possible  situation, 
there  exists  some  time  Ik  such  that  =  0(r(n—  1)).  This  implies  that 

k  =  0((n  —  l)2n).  Now  we  can  upper  bound  the  total  convergence  time  Ik 
by  =  Y2i=i  ai  ~  k  —  —  1)5  where  we  have  used  that  <  n  for  all 

i  S  {1, . . .  ,n}.  From  here  we  see  that  Ik  =  0((n  —  l)3n)  and  hence,  we 
deduce  that  in  0(n(n  —  l)3)  time  instants  there  cannot  be  any  agent  which 
is  not  a  neighbor  of  the  agent  1.  Hence,  all  agents  rendezvous  at  the  next 
time  instant.  Consequently, 

T C  ( "TrTldzvg  ?  AVERAGIN G  ;  P0)  =  O(n(n-l)3). 

Finally,  for  a  general  initial  configuration  Pq,  because  there  are  a  finite 
number  of  agents,  only  a  finite  number  of  splittings  (at  most  n  —  1)  of  the 
connected  components  of  the  communication  graph  can  take  place  along  the 
evolution.  Therefore,  we  conclude  TC(7).11(jzva, CCAVEraging)  =  0(n5). 
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Let  us  now  prove  the  lower  bound.  Consider  an  initial  configuration  Pq  £ 
Mn  where  all  agents  are  positioned  in  increasing  order  according  to  their 
identity,  and  exactly  at  a  distance  r  apart,  say  p^+1l(0)  —  p^(0)  =  r,  i  E 
{1, . . . ,  n  —  1}.  Assume  for  simplicity  that  n  is  odd  -  when  n  is  even,  one  can 
reason  in  an  analogous  way.  Because  of  the  symmetry  of  the  initial  condition, 
in  the  first  time  step,  only  agents  1  and  n  move.  All  the  remaining  agents 
remain  in  their  position  because  it  coincides  with  the  average  of  its  neighbors’ 
position  and  its  own.  At  the  second  time  step,  only  agents  1,  2,  n  —  1  and  n 
move,  and  the  others  remain  static  because  of  the  symmetry.  Applying  this 
idea  iteratively,  one  deduces  that  the  time  step  when  agents  and 
move  for  the  first  time  is  lower  bounded  by  Since  both  agents  have 

still  at  least  a  neighbor  (agent  ZL^),  the  task  Tm dzvs  has  not  been  achieved 
yet  at  this  time  step.  Therefore,  TC(7^ndzvs,  CCAVEraging>  Pq)  >  rk2^,  and  the 
result  follows.  ■ 


4.6.2  Proof  of  Theorem  4.17 

Proof.  We  divide  the  proof  of  the  theorem  into  three  groups,  one  per  net¬ 
work. 

STEP  1:  Facts  on  (5disk)  CCcircumcenter).  Fact  (iv)  for  (5disk)  CCcircumcbnter 
is  a  direct  consequence  of  the  control  function  definition  of  the  CIRCUMCEN- 
ter  law  and  Lemma  4.8. 

Let  us  show  fact  (i).  Because  Q  has  the  same  connected  components 
as  UdiskM,  fact  (iy)  implies  that  the  number  of  connected  components  of 
^disk(^)  can  only  but  decrease.  In  other  words,  the  number  of  agents  in 
each  of  the  connected  components  of  (?disk(0  is  non-decreasing.  Since  there 
is  a  finite  number  of  agents,  there  must  exist  Iq  such  that  the  identity  of 
the  agents  in  each  connected  component  of  £/disk(f)  is  fixed  for  all  t  >  Iq 
(i.e. ,  no  more  agents  are  added  to  the  connected  component  afterwards). 
In  what  follows,  without  loss  of  generality,  we  assume  that  there  is  only 
one  connected  component  after  £q,  i.e.,  the  graph  is  connected  (if  this  is 
not  the  case,  then  the  same  argument  follows  through  for  each  connected 
component). 

Our  strategy  to  prove  that  the  law  CCCIRCUMcenter  (with  control  mag¬ 
nitude  bounds  and  relaxed  ^-connectivity  constraints)  achieves  the  exact 
rendezvous  task  7(ncizvs  consists  of  two  steps. 

(a)  We  first  define  a  set- valued  dynamical  system  ((Rd)n,  (Rd)n,  T)  such 
that  the  evolutions  of  (5disk)  CCcircumcenter)  starting  from  an  initial 
configuration  where  I?disk(r)  is  connected  are  contained  in  the  set  of 
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evolutions  of  the  set-valued  dynamical  system;  and 

(b)  we  then  establish  that  any  evolution  of  ((Rd)n,  ( M.d)n,T )  converges 
to  a  point  in  diag((Md)n)  (the  point  might  be  different  for  different 
evolutions) . 

This  strategy  is  analogous  to  the  discussion  regarding  the  Overapproxima- 
tion  Lemma  for  time-dependent  systems  in  Section  1.3.5. 

Let  as  perform  (a).  Given  a  connected  graph  G  with  vertices  {1, . . . ,  n},  let 
us  consider  the  constraint  sets  and  goal  points  defined  with  respect  to  G.  In 
other  words,  given  P  =  {p\, . . .  ,pn )  G  ( Wl)n ,  define  for  each  i  G  {1, . . . ,  n}, 

(.Pgoal ) i  ■=  CC({pi}  u{pj  |  j  G  NC(i)}), 

Xi  :=  Pi  r4p)  I  j  e  nB(pi,  tw), 

where  rj(P)  =  max{r,  max{||pj  —  pj  || 2  |  j  G  A fait)}}-  Since  two  neighbors 
according  to  G  can  be  arbitrarily  far  from  each  other  in  M.d,  we  need  to 
modify  the  definition  of  the  constraint  set  with  the  radius  rj{P)  to  prevent 
Xj  from  becoming  empty.  Note  that  if  \\pi  —  py  1 1 2  <  f  for  all  j  G  J\fc(i),  then 
ri(P)  =  r,  and  therefore,  Xj  =  X(\\s\^  g('Pi-,  P)  D  B(pti  umax).  It  is  also  worth 
observing  that  both  (pg0ai)i  and  Xj  change  continuously  with  (p\ , . . .  ,pn). 

Define  the  map  itic  :  (Md)n  — ►  (Md)n  by 

ftiG(pi,  .  .  .  ,Pn)  =  (fti(pi,  (Pgoal)l,  A'l),  .  .  .  ,fti(pn,  (Pgoal)n,  Xn)). 

One  can  think  of  ftiG  as  a  circumcenter  law  where  the  neighboring  relation¬ 
ships  among  the  agents  never  change.  Because  fti  is  continuous,  and  (pgoai)i 
and  Xj,  i  G  {l,...,n},  change  continuously  with  (pi, . . .  ,pn),  we  deduce 
that  ftiG  is  continuous. 

We  are  now  ready  to  define  a  set-valued  dynamical  system  ((Mrf)n,  (Md)n,  T) 
through  the  set-valued  map  T  :  (Md)n  (Md)n  given  by 

T(pi, . . . , Pn)  =  {ftiG^i;  •  •  •  iPn)  \  G  is  a  strongly  connected  digraph}. 

Note  that  the  evolution  of  the  circumcenter  law  using  a  proximity  graph 
such  as  I/diskM  is  just  one  of  the  multiple  evolutions  described  by  this  set¬ 
valued  map.  This  concludes  (a). 

Let  us  now  perform  (b).  To  characterize  the  convergence  properties  of 
the  set-valued  dynamical  system,  we  use  the  LaSalle  Invariance  Principle  in 
Theorem  1.18.  With  the  notation  of  this  result,  we  select  W  =  (Mrf)n.  This 
set  is  clearly  strongly  positively  invariant  for  ((Mci)n,  ( M.d)n,T ). 

Closedness  of  the  set-valued  map.  Since  ftiG  is  continuous  for  each  di¬ 
graph  G  and  there  is  a  finite  number  of  strongly  connected  digraphs  on  the 
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vertices  {1, . . .  ,  n},  Exercise  El. 8  implies  that  T  is  closed. 

Common  Lyapunov  function.  Define  the  function  Vdiam  :  (Rrf)n  — >  R>o 
by 

Vdiam ( [P)  =  max{||p?;  -Pj II  I  i,  j  G  {1,. . .  ,n}}. 

With  a  slight  abuse  of  notation,  we  denote  by  co (P)  the  convex  hull  of 
{pi , . . .  ,pn}  C  Rrf.  Note  that  Eiiam (P)  =  diam(co(P)).  The  function  Ediam 
has  the  following  properties. 

(i)  Eiiam  is  continuous  and  invariant  under  permutations  of  its  argu¬ 
ments; 

(ii)  Eiiam ( P)  =  0  if  and  only  if  P  E  diag((Rd)n),  where  recall  that 
diag((Rd)?l)  =  {(pi ,...,pn)  €  ( Rd)n  |  pM  =  ■••  =  pM  E  Rd}  de¬ 
notes  the  diagonal  set  of  (M.d)n.  This  fact  is  an  immediate  conse¬ 
quence  of  the  fact  that,  given  a  set  S  C  (Rd)n,  diam(co(5))  =  0  if 
and  only  if  S'  is  a  singleton;  and 

(iii)  Vdiam  is  non-increasing  along  T  on  (R^)"-.  Consider  a  finite  set 
of  points  S  E  F((Rrf)n)  and  let  CC(S’)  be  its  circumcenter.  From 
Lemma  2.2(i),  we  have  CC(S')  E  co(S).  Therefore,  for  any  strongly 
connected  digraph  G,  we  have  that  co(ftic(P))  C  co (P)  for  any 
P  E  (Rd)n.  Since  for  any  two  sets  Si,  £2  C  (Rd)n  such  that  co(S2)  C 
co(S2)  it  holds  that  Ediam(S'2)  <  Ediam(Si),  then  Ediam(ftiG(P))  < 
Ediam(-P)  for  any  strongly  connected  digraph  G,  which  implies  that 
Vdiam  is  non-increasing  along  T  on  (Rd)n. 

Bounded  evolutions.  Consider  any  initial  condition  (pi(0), . . .  ,pn(0))  E 
(Rd)n.  For  any  strongly  connected  digraph,  G,  we  have 

ftiG(pi(f), . . .  ,pn{£))  G  co(pi(0), . . .  ,Pn(0)), 

for  all  l  E  Z>o-  Therefore,  any  evolution  of  the  set-valued  dynamical  system 
((Rrf)n,  (R d)n,T)  is  bounded. 

Characterization  of  the  invariant  set.  By  the  LaSalle  Invariance  for  set¬ 
valued  dynamical  systems  in  Theorem  1.18,  any  evolution  with  initial  con¬ 
dition  in  W  =  (Rrf)n  approaches  the  largest  weakly  positively  invariant  set 
M  contained  in 

{P  E  (R d)n  I  3P'  E  T(P)  such  that  E^P')  =  Kiiam(P)}. 

We  show  that  M  =  diag((Rrf)n).  Clearly,  diag((Rrf)n)  C  M.  To  prove  the 
other  inclusion,  we  reason  by  contradiction.  Assume  P  E  M  \  diag((Rd)Tl), 
and  therefore,  Ediam(P)  >  0.  Let  G  be  a  strongly  connected  digraph  and 
consider  ftic(P).  For  each  i  E  {1, . . .  ,n},  we  distinguish  two  cases  depend¬ 
ing  on  whether  pt  is  or  is  not  a  vertex  of  co (P).  If  pi  0  Ve(co(P)),  then 
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Lemma  2.2(i)  implies  that  fti(pj,  (pg0ai)i>  %i)  G  co (P)  \  Ve(co (P)). 

If  pi  G  Ve(co(P)),  then  we  must  take  into  consideration  the  possibility  of 
having  more  than  one  agent  located  at  the  same  point.  If  the  location  of 
all  the  neighbors  of  i  in  the  digraph  G  coincides  with  pi ,  then  agent  i  will 
not  move,  and  hence  fti (pi,  (pgoai)i,  %i)  G  Ve(co(P)).  However,  we  can  show 
that  the  application  of  fti^  strictly  decreases  the  number  of  agents  located 
at  pi .  Let  us  denote  this  number  by  Ni,  i.e., 

Ni  =  I ii  G  {1,  ■  •  •  ,n}  I  Pj  =  Pi  and  pj  G  {pi, . . .  ,pn}}\- 

Since  the  digraph  G  is  strongly  connected,  there  must  exist  at  least  an 
agent  located  at  pi  with  a  neighbor  which  is  not  located  at  pi  (otherwise, 
all  agents  would  be  at  pi,  which  is  a  contradiction).  In  other  words,  there 
exist  'i* , j  G  {l,...,n}  such  that  plit  =  pi,  pj  /  pi,  and  j  G  A /g(**)-  By 
Lemma  2.2(i),  we  have  that  (pgoai)i„  G  co (P)  \  Ve(co(P)),  and  therefore, 
(Pgoal)i,  /  Pi,  ■  Combining  this  with  the  fact  that 

{Pi}u{pj  |  j  g  A/g(*)}  c  B(pit,n,(P)), 

we  can  apply  Lemma  2.2(ii)  to  ensure  that  \ptf ,  (pgoai)n[  has  nonempty  in¬ 
tersection  with  .  Therefore,  fti(pi„ ,  (pgoai)i,,  %i,)  G  co(P)  \  Ve(co(P)), 
and  the  number  Ni  of  agents  located  at  pi  decreases  at  least  by  one  with 
the  application  of  ftic- 

Next,  we  show  that,  after  a  finite  number  of  steps,  no  agents  remains  at 
the  location  pi.  Define  N  =  max{Nj  |  pi  G  Ve(co(P))}  <  n  —  1.  Then 
all  agents  in  the  configuration  ftiG1(ftic2(. . .  ftiGJV(P)))  are  contained  in 
co(P)  \  Ve(co(P)),  for  any  collection  of  strongly  connected  directed  graphs 
G\, . . .  ,Gn-  Therefore,  diam(co(fti(31(ftiG2(-  ■ .  ftic?JV(P)))))  <  diam(co(P)), 
which  contradicts  the  fact  that  M  is  weakly  invariant. 

Point  convergence.  We  have  proved  that  any  evolution  of  (( Wl)n ,  (Md)ri,  T) 
approaches  the  set  diag((M<i)Tl).  To  conclude  the  proof,  let  us  show  that  the 
convergence  of  each  trajectory  is  to  a  point,  rather  that  to  the  diagonal  set. 
Let  {P(t)  |  i  G  Z>o }  be  an  evolution  of  the  set-valued  dynamical  system. 
Since  the  sequence  is  contained  in  the  compact  set  co(P(0)),  there  exists  a 
convergent  subsequence  (P(£fc)  |  k  G  Z>o},  i.e.,  there  exists  p  G  M.d  such 
that 

lim  P(4)  =  (p, . . .  ,p).  (4.6.2) 

K— ►+ OO 

Let  us  show  that  the  whole  sequence  {P(£)  \  £  G  Z>o }  converges  to  (p, . . .  ,p). 
Because  of  (4.6.2),  for  any  e  >  0,  there  exists  ko  such  that  for  k>k$  one  has 
co(P(4))  C  B{p,e/ y/n).  From  this,  we  deduce  that  co (P(£))  C  B(p,e/y/n) 
for  all  £  >  40>  which  in  turn  implies  that  || P(£)  —  (p, ...  ,p) || 2  <  e  for  all 
f  >40l  as  claimed.  This  concludes  (b). 
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(a)  and  (b)  imply  that  any  evolution  of  (Sdisk,  CCcircumcenter)  starting 
from  an  initial  configuration  where  f/disk(r)  is  connected  converges  to  a  point 
in  diag((Md)n).  To  conclude  the  proof  of  fact  (i),  we  only  need  to  establish 
that  this  convergence  is  in  finite  time.  This  last  fact  is  a  consequence  of 
Exercise  E4.5. 

Fact  (v)  for  (5disk>  CCcircumcenter)  is  a  consequence  of  facts  (i)  and  (iv). 

STEP  2:  Facts  on  (<SLd, CCCIRCUMcenter)-  The  proof  of  facts  (i),  (iv), 
and  (v)  for  (5ld,  CCoircumcenter)  is  analogous  to  the  proof  of  these  facts  for 
(‘Sdisk;  CCcircumcenter),  and  we  leave  it  to  the  reader. 

STEP  3:  Facts  on  (5oo-disk,  CCpll_crcmcntr).  From  the  expression  for 
the  control  function  of  CCpll_crcmcntr,  we  deduce  that  the  evolution  un¬ 
der  CCpll_crcmcntr  of  the  robotic  network  5oo-diSk  (in  d  dimensions)  can  be 
alternatively  described  as  the  evolution  under  CCCIRCUMcenter  of  d  robotic 
networks  5disk  in  M,  see  Exercise  E4.4.  Therefore,  facts  (i),  (iv),  and  (v)  for 
(^cxD-disk;  CCpll-crcmcntr)  follow  from  facts  (i),  (iv),  and  (v)  for  (5disk,CCCIRCUMCENTER 


4.6.3  Proof  of  Theorem  4.18 

Proof.  Let  Po  =  (p^(0), . . .  ,p^(0))  E  Kn  denote  the  initial  condition. 

Fact  (i).  For  d  =  1,  the  connectivity  constraints  on  each  agent  i  E  {1, . . . ,  n} 
imposed  by  the  constraint  set 

Tdisk(pW,  {Prcvd  |  for  all  non-null  prcvd  E  y W})  (4.6.3) 

are  superfluous.  In  other  words,  the  goal  configuration  resulting  from  the 
evaluation  by  agent  i  of  the  control  function  of  the  circumcenter  law 
belongs  to  the  constraint  set  in  (4.6.3).  Moreover,  the  order  of  the  robots 
on  the  real  line  is  preserved  from  one  time  step  to  the  next  one.  Both 
observations  are  a  consequence  of  Exercise  E4.3. 

Let  us  first  establish  the  upper  bound  in  fact  (i).  Consider  the  case  when 
f?disk(r)  is  connected  at  Po-  Without  loss  of  generality,  assume  that  the 
agents  are  ordered  from  left  to  right  according  to  their  identifier,  that  is, 
pM(0)  <  <  p[nl(0).  Let  a  E  {3,...,n}  have  the  property  that  agents 

{2, . . .  ,a  —  1}  are  neighbors  of  agent  1,  and  agent  a  is  not.  (If  instead  all 
agents  are  within  an  interval  of  length  r,  then  rendezvous  is  achieved  after 
one  time  step,  and  the  upper  bound  in  fact  (i)  is  easily  seen  to  be  true.) 
Figure  4.12  presents  an  illustration  of  the  definition  of  a.  Note  that  we  can 
assume  that  agents  {2, . . . ,  a  —  1}  are  also  neighbors  of  agent  a.  If  this  is  not 
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the  case,  then  those  agents  that  are  neighbors  of  agent  1  and  not  of  agent 
a,  rendezvous  with  agent  1  after  one  time  step.  At  the  time  instant  1=1. 
the  new  updated  positions  satisfy 


P[1](  1)  = 


pW(0)  +  p[a_1](0) 


pW(  1)  G 


pM  (0)  +  p^  (0)  pW  (0)  +  pW  (0)  +  r 


for  7  E  {2,...,  a  —  1}.  These  equalities  imply  that  pW(l)  —  pW(0)  = 
i('p[«-1](0)  _  pi1]  (0))  <  7 \r.  Analogously,  we  deduce  pi1!  (2)  —  pW(l)  <  |r, 
and  therefore, 


pW  (2)  —  pM  (0)  <  r. 


(4.6.4) 


On  the  other  hand,  from  pW  (2)  E  (pW  (1)  +  pl“  11  (1)) ,  *]  (where  the  sym¬ 
bol  *  represents  a  certain  unimportant  point  in  M),  we  deduce 

pm(2)  -pl'l(O)  >  i(p[‘](l)+pl”-‘l(l))  -p['](0) 

i^-pW(O)) 

=  )  (pW  (0)  -  P[1]  (o))  >  ir .  (4.6.5) 


>)(PI^1|(1)-P|1|(0))> 


Inequalities  (4.6.4)  and  (4.6.5)  mean  that,  after  at  most  two  time  steps, 
agent  1  has  traveled  an  amount  larger  than  r / 4.  In  turn  this  implies  that 

1  4 

—  diam(co(.Po))  <  TC(7(.ndzvs,CCCnlcUMCENTBR, Pq)  <  —  diam(co(Po))- 
r  r 


If  I/disk (r)  is  not  connected  at  Pq,  note  that  along  the  network  evolution, 
the  connected  components  of  the  r-disk  graph  do  not  change.  Using  the 
previous  characterization  on  the  amount  traveled  by  the  leftmost  agent  of 
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each  connected  component  in  at  most  two  time  steps,  we  deduce 


TC(7^ndzvS)  CCcircumcenterj  Po)  <  —  max  diam(co(C)), 

T  CgC(Pq) 


where  C(Po)  denotes  the  collection  of  connected  components  of  (?disk(0  at  Po- 
The  connectedness  of  each  C  €  C(Pq)  implies  that  diam(co(C))  <  (n  —  l)r, 
and  therefore,  TC(7;ndzvs,  CCCiRCumcenter)  €  0(n). 

The  lower  bound  in  fact  (i)  is  established  by  considering  Po  £  Kn  such  that 
p[*+1l (0)  —  pW(0)  =  r,  i  £  {1, . . .  ,n  —  1}.  For  this  configuration,  we  have 
diam(co(P0))  =  (n  -  1  )r,  and  therefore,  TC(7;ndzvs,  CCCircumcenter,  Po)  > 
n  —  1. 

Fact  (ii).  In  the  r-limited  Delaunay  graph,  two  agents  on  the  line  that  are 
at  most  at  a  distance  r  from  each  other  are  neighbors  if  and  only  if  there 
are  no  other  agents  between  them.  Also,  note  that  the  r-limited  Delaunay 
graph  and  the  r-disk  graph  have  the  same  connected  components,  cf.  The¬ 
orem  2.9.  A  similar  argument  to  the  one  used  in  the  proof  of  fact  (i)  above 
guarantees  that  the  connectivity  constraints  imposed  by  the  constraint  sets 
Adisk(pW,  {prcvd  |  for  all  non-null  prcvd  £  yW})  are  again  superfluous. 

Consider  first  the  case  when  Ghui1")  is  connected  at  Po-  Note  that  this 
is  equivalent  to  ^disk(^)  being  connected  at  Po.  Without  loss  of  generality, 
assume  that  the  agents  are  ordered  from  left  to  right  according  to  their 
identifier,  that  is,  pM(0)  <  •  <  piTL\ 0).  The  evolution  of  the  network 

under  CCcircumcentbr  can  then  be  described  as  the  discrete-time  dynamical 
system 


p[l\i  +  l)  =  ^(pW(£)+pW(e)), 
p[2](i  +  l)  =  ^(p[1]{£)  +p[3](£)), 

p[n-l]{£+1)  =  ^p[n-2]{£)+p[n]m 
P[n]{£+1)  =  l(p[n~1](£)+P[n](£)). 


Note  that  this  evolution  respects  the  ordering  of  the  agents.  Equivalently, 
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we  can  write  P{£  +  1)  =  AP(£),  where  A  G  M.nxn  is  the  matrix  given  by 


A  = 


0 


0 

0 

0 


0 

0 


1 

I 

2. 


Note  that  A  =  ATYid+  (^,0),  as  defined  in  Section  1.6.4.  Theorem  1.77(i) 
implies  that,  for  Pave  =  ll^Po,  we  have  that  lim ^+00P{£)  =  Payeln,  and 
that  the  maximum  time  required  for  || P(£)  —  Paveln||2  <  r?||Po  ~  Paveln||2 
(over  all  initial  conditions  in  R")  is  0(n2  logr/-1).  (Note  that  this  also 
implies  that  agents  rendezvous  at  the  location  given  by  the  average  of  their 
initial  positions.  In  other  words,  the  asymptotic  rendezvous  position  for  this 
case  can  be  expressed  in  closed  form,  as  opposed  to  the  case  with  the  r-disk 
graph.) 


Next,  let  us  convert  the  contraction  inequality  on  2-norms  into  an  appro¬ 
priate  inequality  on  oo-norms.  Note  that  diam(co(Po))  <  (n  —  1  )r  because 
C?ld {r)  is  connected  at  Po-  Therefore, 

|| Po  -  Pave  1 1 1 oo  =  max  |pW(0)  -  Pave|  <  |p[1]  (0)  -  pW(0)|  <  (n  -  1  )r. 
ie{l,...,n} 

For  £  of  order  n2  log  77  ,  we  use  this  bound  on  ||Pq  —  Pavel||oo  and  the  basic 

inequalities  ||w||oc  <  ||u||2  <  y^lMloo  for  all  v  G  Mn,  to  obtain 

||  P{£)  -  Pave  1 1 1  oo  <  ||P(0  -  Pave  1 1 1 2  <  VW^O  ~  -Favellh 
<  T]y/n\\Po  -  Pavel  Hoc  <  r)sfn{n  -  1  )r. 

This  means  that  (re)-rendezvous  is  achieved  for  rjy/n(n  —  1  )r  =  re,  that  is, 
in  time  0(n2  logrp1)  =  0(n2  log(ne-1)). 


Next,  we  show  the  lower  bound.  Consider  the  unit-length  eigenvector 
vn  =  yj 7TT(sin  npL  >  •  •  ■  >  s^n  )T  e  of  Tridn_i(l,0, 1)  corresponding 
to  the  largest  singular  value  cos(^).  For  fi  =  io^rn5/2,  we  then  define  the 
initial  condition 


Po  =  Vp+ 


g  r. 


One  can  show  that  pM(0)  <  p[*+1](0)  for  i  G  {1, . . .  ,n  —  1},  that  Pave  =  0, 
and  that  max{p[!+1l  (0)  —  pW(0)  |  i  G  {1, . . .  ,  n—  1}}  <  r.  Using  Lemma  1.79 
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and  because  ||ic||oo  <  ll^lb  <  v^HHIoo  for  all  w  G  Mn,  we  compute 


P0  1 1  oo  — 


rn 


5/2 


10^2 


Pi 


0 

Vn-1 


> 


rn 


10V2 


P- 


0 

v„_i 


rn  rn 

-  Io7i  Vn_1 2  =  ioTl' 

The  trajectory  P(£)  =  (cos(^))*Po  therefore  satisfies 

IIPWIU  =  (  COS  (~)  )^l|Polloo  > 

Therefore,  ||P(£)||oo  is  larger  than  ^ re  so  long  as  j^/^n(cosin)Y  >  \£i  that 
is,  so  long  as 


(  log(e  ln)  -  log(5y/2) 
log  ( cos  (  ~  )  ) 

Exercise  E4.7  asks  the  reader  to  show  that  the  asymptotics  of  this  bound 
corresponds  to  the  lower  bound  in  fact  (i). 


Now  consider  the  case  when  ^ld(^)  is  not  connected  at  Pq.  Note  that  the 
connected  components  do  not  change  along  the  network  evolution.  There¬ 
fore,  the  previous  reasoning  can  be  applied  to  each  connected  component. 
Since  the  number  of  agents  in  each  connected  component  is  strictly  less  that 
n,  the  time  complexity  can  only  but  improve.  Therefore,  we  conclude  that 

TC(Tmdzvs,CCCIRCUMCENTER)  £  &(n2  log(ne  1)). 


Fact  (Hi).  Recall  from  the  proof  of  Theorem  4.6.2  that  the  evolution  under 
CCpll-crcmcntr  of  the  robotic  network  5oo-disk  (in  d  dimensions)  can  be  al¬ 
ternatively  described  as  the  evolution  under  CCCIRCUMcenter  of  d  robotic  net¬ 
works  5disk  in  M,  see  Exercise  E4.4.  Fact  (iii)  now  follows  from  fact  (i).  ■ 


4.6.4  Proof  sketch  of  Theorem  4.21 

Here,  we  only  provide  a  sketch  of  the  proof  of  Theorem  4.21.  Fact  (i)  is  a 
consequence  of  the  control  function  definition  of  the  NONCONVEX  CIRCUM¬ 
CENTER  law  in  Section  4.3.4  and  Lemma  4.11.  Fact  (ii)  follows  from  the 
fact  that  the  law  CCNONconvex  circumcenter  (with  control  magnitude  bounds) 
achieves  the  e-rendezvous  task  '7^_rndzvs  and  fact  (i). 

In  order  to  show  that,  on  the  network  cSvis_disk,  the  law  CCNONconvex  circumcenter 
(with  control  magnitude  bounds)  achieves  the  e-rendezvous  task  T£_ rndzvs, 
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one  can  follow  the  same  overapproximation  strategy  that  we  used  in  the 
proof  of  Theorem  4.17,  STEP  1,  i.e., 

(a)  define  a  set- valued  dynamical  system  (Qg,  Qg,  T)  such  that  the  evo¬ 
lutions  of  (5vis-disk;  CCnonconvex  circumcenter)  starting  from  an  initial 
configuration  where  f/vis-disk,Q«  is  connected  are  contained  in  the  set 
of  evolutions  of  the  set-valued  dynamical  system;  and 

(b)  establish  that  any  evolution  of  {Q7g,Qrg,T)  converges  to  a  point 
in  diag^^)  (note  that  the  point  might  be  different  for  different 
evolutions) . 

We  refer  to  [Ganguli  et  ah,  2007b]  for  a  detailed  development  of  this  proof 
strategy.  Here,  we  only  remark  that  in  order  to  carry  out  (b),  the  proof  uses 
the  LaSalle  Invariance  Principle  in  Theorem  1.18  with  the  perimeter  of  the 
relative  convex  hull  of  a  set  of  points  as  Lyapunov  function. 


4.7  EXERCISES 

E4.1  (Maintaining  connectivity  of  sparser  networks).  Prove  Lemma  4.8. 

Hint:  Use  Lemma  f.2  and  the  fact  that  Q  and  C/disk(r)  have  the  same  connected 
components. 

E4.2  (Maintaining  network  line-of-sight  connectivity).  Prove  Lemma  4.11. 
Hint:  Use  Proposition  f.9. 

E4.3  (Enforcing  range-limited  links  is  unnecessary  for  the  circumcenter  law 

on  R).  Let  V  =  {pi, . . .  ,pn}  £  F(R).  For  r  £  R>o,  we  work  with  the  r-disk 
proximity  graph  Sdisk(r)  evaluated  at  V.  Let  i  £  {l,...,n}  and  consider  the 
circumcenter  of  the  set  comprised  of  pi  and  of  its  neighbors, 

(Pgoal)i  =  CC({Pi}  U  A7;d 

isk  (r).Pi  <?))■ 

Show  the  following  holds. 

(i)  If  ^  and  Pj  are  neighbors  in  <5disk(r),  then  (pgoa i)i  belongs  to  f?(p* ,  §); 

(ii)  if  Pi  and  pj  are  neighbors  in  Qdisk(r)  and  pi  <Pj,  then  (pgoai)i  <  (pgoaib; 
and 

(iii)  discuss  the  implication  of  (i)  and  (ii)  in  the  execution  of  the  circumcen¬ 
ter  law  on  the  1-dimensional  space  R. 

Hint:  Express  (pgoai)i  as  a  function  of  the  position  of  the  leftmost  and  rightmost 
points  among  the  neighbors  of  pi. 

E4.4  (Enforcing  range-limited  links  is  unnecessary  for  the  pll-crcmcntr  law). 

Let  V  =  {pi,  ■  ■  ■  ,Pn}  £  F(Rd)  and  r  £  R>o.  For  k  £  {l,...,d},  denote  by 
ivk  ■  Rd  — >  R  the  projection  onto  the  fcth  component.  Do  the  following  tasks. 

(i)  Show  that  pi  and  Pj  are  neighbors  in  C/oo-disk(r)  if  and  only  if,  for  all 
k  £  {1, . . .  ,d},  nk(pi)  and  nk(pj)  are  neighbors  in  C/disk (r); 

(ii)  for  S  C  Rd,  justify  that  the  parallel  circumcenter  PCC(S')  €  Rd  of  S  can 
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be  described  as 

ttRPCC (S'))  =  CC(t rk(S)),  for  k  £  {1, . . . ,  d}- 

(iii)  use  (i),  (ii),  and  Exercise  E4.3(i)  to  justify  that  no  constraint  is  required 
to  maintain  connectivity  of  the  oo-disk  graph  in  the  pll-CRCMCNTR  law. 

In  other  words,  show  that  if  pi  and  Pj  are  neighbors  in  Sco-disk(r),  then 
PCC({pi}  U  Afgoo.diskW,Pi(P))  and  PCC(fe}  U Mgao.dis^r),Pj  (V))  are  also 
neighbors  in  Soo-disk(r). 

E4.5  (Finite-time  convergence  of  the  circumcenter  law  on  Sdisk).  For  Umax,  r  £ 
R>o,  let  a  =  min{umax,  §}■  Let  V  =  {pi, . . .  ,pn}  £  F(Rd),  and  assume  there  exists 
p  £  Rd  such  that 


{pi,...,pn}  C  B(p,a). 


Do  the  following  tasks. 

(i)  Show  that  C/disk  (r)  evaluated  at  {pi, . . .  ,pn}  is  the  complete  graph; 

(ii)  justify  why  || ps  -  CC({pi, . . .  ,p„})||2  <  a,  for  all  i  £  (1, . . .  ,n}; 

(iii)  show  that  CC({pi, . . .  ,p„})  £  Xdisk{pi,V)  C\  B(pi  5  ^max)  ? 

(iv)  what  is  the  evolution  of  (Sdisk,  CCcircumcenter)  (with  control  magnitude 
bounds)  starting  from  (pi, . . .  ,pn)? 

E4.6  (Variation  of  the  circumcenter  law).  Let  V  =  {pi, . . .  ,pn}  £  F(Rd).  For 
r  £  R>o,  we  work  with  the  r-disk  proximity  graph  f/disk(r)  evaluated  at  V.  For 
each  i  £  {1, . . .  ,n},  consider  the  circumcenter  of  the  set  comprised  of  pi  and  of 
the  mid-points  with  its  neighbors, 

(Pgoal)i  =  CC({JH}  U  |  Pj  £  AfSdiBk(r),PiC P)}). 

Show  the  following  holds. 

(i)  If  pi  and  pj  are  neighbors  in  C/disk(r),  then  (pgoai)i  and  (pgoai)j  are  neigh¬ 
bors  in  C/disk  (r); 

(ii)  use  (i)  to  design  a  control  and  communication  law  on  the  network  Sdisk 
in  Rd  that,  while  not  enforcing  any  connectivity  constraints,  preserves  all 
neighboring  relationships  in  Sdisk  (r)  and  achieves  the  e-rendezvous  task 

i^e-rndzvs ,  aild 

(iii)  justify  why  the  law  designed  in  (ii)  does  not  achieve  the  exact  rendezvous 

task  Srndzvs  ■ 

E4.7  (Asymptotics  of  the  lower  bound  in  Theorem  4.18(h)).  Show  that,  as 

n  — ►  +oo, 

Me-'s)  _ 

-log(cOs(^))  7T2V 

Use  this  fact  to  complete  the  proof  of  the  lower  bound  in  the  proof  of  Theo¬ 
rem  4.18(h). 

Hint:  Use  the  Taylor  series  expansion  of  log(cos(x))  at  x  =  0. 
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Chapter  Five 

Deployment 


The  aim  of  this  chapter  is  to  present  various  solutions  to  the  deployment 
problem.  The  deployment  objective  is  to  optimally  place  a  group  of  robots 
in  an  environment  of  interest.  The  approach  taken  here  consists  of  identify¬ 
ing  aggregate  functions  that  measure  the  quality-of-deployment  of  a  given 
network  configuration  and  designing  control  and  communication  laws  that 
optimize  these  measures. 

The  variety  of  algorithms  presented  in  the  chapter  stems  from  two  rea¬ 
sons.  First,  different  solutions  arise  from  the  interplay  between  the  spatially 
distributed  character  of  the  coordination  algorithms  and  the  limited  sensing 
and  communication  capabilities  of  the  robotic  network.  As  an  example,  dif¬ 
ferent  solutions  are  feasible  when  agents  have  range-limited  communication 
capabilities  or  when  agents  have  omnidirectional  line-of-sight  visibility  sen¬ 
sors.  Second,  there  is  no  universal  notion  of  deployment.  Different  scenarios 
give  rise  to  different  ways  of  measuring  what  constitutes  a  good  deploy¬ 
ment.  As  an  example,  a  robotic  network  might  follow  a  different  strategy 
depending  on  whether  it  has  information  about  areas  of  importance  in  the 
environment  or  not.  In  the  first  case,  by  incorporating  the  knowledge  on 
the  environment.  In  the  second,  by  assuming  a  worst-case  scenario,  where 
important  things  can  be  happening  precisely  at  the  furthest  away  location 
from  the  network  configuration. 

Our  exposition  here  follows  [Cortes  et  al.,  2004,  2005,  Cortes  and  Bullo, 
2005].  Our  approach  makes  extensive  use  of  the  multicenter  functions  from 
geometric  optimization  introduced  in  Chapter  2.  It  is  not  difficult  to  syn¬ 
thesize  continuous-time  gradient  ascent  algorithms  using  the  smoothness  re¬ 
sults  presented  in  Section  2.3,  and  characterize  its  asymptotic  convergence 
properties  (as  we  ask  the  reader  to  do  in  Exercises  E2.ll  and  E2.12).  How¬ 
ever,  following  the  robotic  network  model  of  Chapter  3,  we  are  interested 
in  discrete-time  algorithms.  In  general,  gradient  ascent  algorithms  imple¬ 
mented  in  discrete  time  require  the  selection  of  appropriate  step  sizes  that 
guarantee  the  monotonic  evolution  of  the  objective  function.  This  is  usually 
accomplished  via  line  search  procedures,  e.g.,  see  [Bertsekas  and  Tsitsiklis, 
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1997].  In  this  chapter  we  show  that  the  special  geometric  properties  of  the 
multicenter  functions  and  their  gradients  allow  us  to  identify  natural  target 
locations  for  the  robotic  agents  without  the  need  to  perform  any  line  search. 

The  chapter  is  organized  as  follows.  In  the  first  section  of  the  chapter 
we  formally  define  the  notions  of  deployment  via  task  maps  and  multicenter 
functions.  In  the  next  section,  we  present  motion  coordination  algorithms  to 
achieve  each  deployment  task.  Specifically,  we  introduce  control  and  com¬ 
munication  laws  based  on  various  notions  of  geometric  centers.  We  present 
convergence  and  complexity  results  on  the  proposed  algorithms,  along  with 
simulations  illustrating  our  analysis.  The  third  section  presents  various  sim¬ 
ulations  of  the  proposed  motion  coordination  algorithms.  We  end  the  chap¬ 
ter  with  three  sections  on,  respectively,  bibliographical  notes,  proofs  of  the 
results  presented  in  the  chapter,  and  exercises.  Throughout  the  exposition, 
we  make  extensive  use  of  proximity  graphs,  multicenter  functions,  and  ge¬ 
ometric  optimization.  The  convergence  and  complexity  analysis  are  based 
on  the  LaSalle  Invariance  Principle  and  on  linear  dynamical  systems  defined 
by  Toeplitz  matrices. 


5.1  PROBLEM  STATEMENT 

Here,  we  introduce  various  notions  of  deployment.  Let  S  =  ({1, . . . ,  n},  1Z,  Ecmm) 
be  a  uniform  robotic  network,  where  the  robots’  physical  state  space  is  a 
(simple  convex)  polytope  QcR'*  that  describes  an  environment  of  interest. 

We  define  our  notions  of  deployment  relying  upon  the  geometric  optimiza¬ 
tion  problems  discussed  in  Section  2.3.  Loosely  speaking,  we  aim  to  deploy 
the  robots  in  such  a  way  as  to  optimize  one  of  the  multicenter  functions,  such 
as  the  expected- value  multicenter  function  Tiex p,  the  disk-covering  multicen¬ 
ter  function  Tide,  or  the  sphere-packing  multicenter  function  Hsp.  Indeed 
these  functions  can  be  interpreted  as  quality-of-service  measures  for  different 
scenarios.  In  order  to  formally  define  the  task  maps  encoding  the  deploy¬ 
ment  objective,  we  take  the  following  approach:  since  the  optimizers  of  these 
measures  are  critical  points,  and  these  critical  points  are  network  configu¬ 
rations  that  make  the  gradients  vanish,  we  define  the  task  map  to  take  the 
true  value  at  these  configurations. 


5.1.1  The  distortion,  area,  and  mixed  distortion-area  deployment  tasks 

Here  we  define  various  notions  of  deployment  originating  from  the  expected- 
value  multicenter  function  TCex p.  Recall  the  concepts  of  density  and  perfor¬ 
mance  introduced  in  Section  2.3.  Let  <f>  :  Md  — >  M>o  be  a  density  func¬ 
tion  on  Md  with  support  Q.  One  can  interpret  cj>  as  a  function  measur- 
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ing  the  probability  that  some  event  takes  place  over  the  environment.  Let 
/  :  M>o  — >  R  be  a  performance,  i.e. ,  a  non-increasing  and  piecewise  dif¬ 
ferentiable  function  possibly  with  finite  jump  discontinuities.  Performance 
functions  describe  the  utility  of  placing  a  robot  at  a  certain  distance  from 
a  location  in  the  environment.  Here,  we  restrict  our  attention  to  the  cases 
f(x)  =  — x 2  (distortion  problem),  f(x)  =  l[oiQ](x),  a  E  M>o  (area  problem), 
and  f(x)  =  —x2  lr0ai ix)  ~  a2  ■  lw+oofOOi  with  a  E  R>o  (mixed  distortion- 
area  problem). 


For  e  G  K>o,  the  e-distortion  deployment  task  7^-distor-dpiy  :  Qn  — >  {true,  false} 
is  defined  as 


% 


£-distor- 


dply(-P) 


true,  if  ||pM  -  CM0(HW(P))||2  <  e,  i  G  {1, . . . ,  n}, 
false,  otherwise, 


where  V^(P)  denotes  the  Voronoi  cell  of  robot  i,  and  CM^,(I/W(p))  de¬ 
notes  its  centroid  computed  according  to  cj>,  see  Section  2.1.  In  other  words, 
^-distor-dpiy  is  true  for  those  network  conhgurations  where  each  robot  is  suf¬ 
ficiently  close  to  the  centroid  of  its  Voronoi  cell.  According  to  Theorem  2.17, 
centroidal  Voronoi  configurations  are  the  critical  points  of  the  multicenter 
function  Pdistor- 


For  r,  e  G  M>o,  the  £-r-area  deployment  task  Te.r. area-dpiy  :  Qn  — >  {true,  false} 
is  defined  as  follows: 


7s-r- area-dply  (H) 


true, 

false, 


if  II  /vw(P)naB(p[«i,|)  nout,s(Pw,p(9)</>(9)^||2  <  s,  i  €  {1,  •  •  ■  ,n}, 
otherwise. 


In  other  words,  Te-r- area-dply  is  true  for  those  network  configurations  where 
each  agent  is  sufficiently  close  to  a  local  maximum  for  the  area  of  its  (j -limited 

Voronoi  cell  Vr\p)  =  1/M  (p)  n  £>(/jW ,  (j)  at  fixed  V^(P).  According  to 
2  Z 

Theorem  2.17,  (^-limited  area-centered  Voronoi  configurations  are  the  critical 
points  of  the  multicenter  function  ?fareai  r. 


For  r,  e  G  R>o,  the  e-r- distortion- area  deployment  task  ^-r-distor-area-dpiy  : 
Qn  — »  {true,  false}  is  defined  as  follows: 


% 


e-r-distor-area- 


,-dply  (P) 


i 


true, 

false, 


if  ||PM  _  CM0(v|](P)))||2  <  e,  i  G  {1, . . .  ,n}, 
otherwise. 


In  other  words,  '7{-r-distor-area-dply  is  true  for  those  network  configurations 
where  each  robot  is  sufficiently  close  to  the  centroid  of  its  ((-limited  Voronoi 
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cell.  According  to  Theorem  2.17,  | -limited  centroidal  Voronoi  configurations 
are  the  critical  points  of  the  multicenter  function  Tfdistor-area, r  ■ 


5.1.2  The  disk-covering  and  sphere-packing  deployment  tasks 


Here  we  provide  two  additional  notions  of  deployment  based  on  the  multi¬ 
center  functions  Hdc  and  Hsp,  respectively. 


For  e  E  M>o,  the  e- disk- covering  deployment  task  T^-dc-dpiy  :  Qn 
{true,  false}  is  defined  as 


^e-dc-dply  (P) 


true,  if  || pH  -  CC(VrW(P))||2  <  e,  i  G  {1, . . .  ,n}, 
false,  otherwise, 


where  CC (VW(P))  denotes  the  circumcenter  of  the  Voronoi  cell  of  robot  i. 
In  other  words,  ^-dc-dply  is  true  for  those  network  configurations  where  each 
robot  is  sufficiently  close  to  the  circumcenter  of  its  Voronoi  cell.  Accord¬ 
ing  to  Section  2.3.2,  circumcenter  Voronoi  configurations  are,  under  certain 
technical  conditions,  critical  points  of  the  multicenter  function  Tide- 


For  e  E  M>o,  the  e- sphere-packing  deployment  task  7}_sp_dpiy  :  Qn 
{true,  false}  is  defined  as 


^e-sp-dply  (P) 


true,  if  dist2(p^,IC(VW(P)))  <  e,  *e{l,...,n}, 
false,  otherwise, 


where  IC(V^(P))  denotes  the  incenter  set  of  the  Voronoi  cell  of  robot  i.  In 
other  words,  ^-sp-dply  is  true  for  those  network  configurations  where  each 
robot  is  sufficiently  close  to  the  incenter  set  of  its  Voronoi  cell.  According 
to  Section  2.3.3,  incenter  Voronoi  configurations  are,  under  certain  technical 
conditions,  critical  points  of  the  multicenter  function  7is p. 


5.2  DEPLOYMENT  ALGORITHMS 

In  this  section  we  present  algorithms  that  can  be  used  by  a  robotic  network  to 
achieve  the  various  notions  of  deployment  introduced  in  the  previous  section. 
Throughout  the  discussion  we  mainly  focus  on  the  uniform  networks  and 
5ld  of  locally-connected  first-order  agents  in  a  polytope  Q  C  M.d  with  the 
Delaunay  and  r-limited  Delaunay  graphs  as  communication  graphs,  respec¬ 
tively;  these  robotic  networks  were  introduced  in  Example  3.4. 

All  the  laws  presented  in  this  chapter  share  a  similar  structure,  that  we 
loosely  describe  as  follows. 
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[Informal  description]  At  each  communication  round  each  agent 
performs  the  following  tasks:  (i)  it  transmits  its  position  and 
receives  its  neighbors’  positions;  (ii)  it  computes  a  notion  of  geo¬ 
metric  center  of  its  own  cell  determined  according  to  some  notion 
of  partition  of  the  environment.  Between  communication  rounds, 
each  robot  moves  toward  this  center. 


The  notions  of  geometric  center  and  of  partition  of  the  environment  are 
different  for  each  algorithm,  and  specifically  tailored  to  the  deployment  task 
at  hand.  Let  us  examine  them  for  each  case. 


5.2.1  Geometric-center  laws 

We  present  control  and  communication  laws  defined  on  the  network  All 
the  laws  share  in  common  the  use  of  the  notion  of  Voronoi  partition  of  the 
environment  Q.  We  first  introduce  the  Vrn-cntrd  law,  which  makes  use  of 
the  notion  of  centroid  of  a  Voronoi  cell.  We  then  propose  two  sets  of  varia¬ 
tions  to  this  law.  On  the  one  hand,  we  present  the  VRN-CNTRD-DYNMCS  law, 
which  implements  the  same  centroid  strategy  on  a  network  of  planar  vehi¬ 
cles.  On  the  other  hand,  we  introduce  the  Vrn-CRCMCNTR  and  Vrn-ncntr 
laws,  which  instead  make  use  of  the  notions  of  circumcenter  and  incenter  of 
a  Voronoi  cell,  respectively. 


5. 2. 1.1  Voronoi-centroid  control  and  communication  law 

Here  we  define  the  Vrn-cntrd  control  and  communication  law  for  the 
network  5d>  that  we  denote  by  CCyRN_CNTRD.  This  law  was  introduced 
by  Cortes  et  al.  [2004],  We  formulate  the  algorithm  using  the  description 
model  of  Chapter  3.  The  law  is  uniform,  static,  and  data-sampled,  with 
standard  message-generation  function. 


Robotic  Network:  with  discrete-time  motion  model  (4.1.1) 

in  Q,  with  absolute  sensing  of  own  position 

Distributed  Algorithm:  VRN-CNTRD 
Alphabet:  A  =  MdU{null} 

function  msg (p,i) 
l:  return  p 

function  ctl (p,y) 


221 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


1:  V  :=  Q  0  (  fll-ftp.Prcvd  I  for  a11  non-null  prcvd  S  y }) 
2:  return  CM^(V)  —  p 


Recall  that  Hp  x  is  the  half-space  of  points  q  in  Md  with  the  property  that 
| \q  —  p || 2  <  || q  —  x\\2-  Since  the  centroid  of  a  Voronoi  cell  belongs  to  the 
cell  itself,  the  robots  never  leave  the  set  Q  or,  in  other  words,  the  set  Qn 
is  positively  invariant  with  respect  to  the  control  and  communication  law 
CCyRN_CNTRD.  Moreover,  note  that  the  direction  of  motion  specified  by  the 
control  function  ctl  coincides  with  the  gradient  of  the  distortion  multicenter 
function  Hdistor-  Hence,  this  law  prescribes  a  gradient  ascent  strategy  for 
each  robot  that,  as  we  will  show  later,  monotonically  optimizes  Tfdistor- 


5. 2. 1.2  Voronoi-centroid  law  on  planar  vehicles 


Next,  we  provide  an  interesting  variation  of  the  Vrn-cntrd  law  defined  on 
the  network  5vehicies  introduced  in  Example  3.5.  Accordingly,  we  adopt  the 
continuous-time  motion  model  for  the  unicycle  vehicle 

pN(t)  =  v®(t)  (cos(6»W(f)),  sin (0N(t))), 

=  ie{l,...,n},  (5.2.1) 

where  we  assume  that  forward  and  angular  velocities  are  upper  bounded.  We 
refer  to  this  law  as  Vrn-cntrd-dynmcs,  and  we  denote  it  by  CCvRn-cntrd-dynmcs- 
The  law  was  introduced  by  Cortes  et  al.  [2004],  is  uniform  and  static,  but 
not  data-sampled. 


Robotic  Network:  5vehides  with  motion  model  (5.2.1)  in  Q, 
with  absolute  sensing  of  own  position 

Distributed  Algorithm:  VRN-CNTRD-DYNMCS 
Alphabet:  A  =  M2  Ujnull} 

function  msg ((p,9),i) 

i:  return  p 

function  ctl((p,  6),  (pSmPld,  0Smpld),  y) 

1:  \  :=  Q  n  (  ni-^Psmpld.Prcvd  I  for  all  non- null  prcvd  £  y}) 

2:  v  :=  kprop (cos  9 ,  sin  9)  ■  (p  -  CM^(1/)) 

3.  w  ._  2k  arctan  (~sin^  cos  6)  •  (p  -  CM^H)) 

3-  w  —  ^prop  arctan  ^  :  g  _  CM^(R)) 

4:  return  (v,u i) 


This  algorithm  is  illustrated  in  Figure  5.1. 
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Figure  5.1  Illustration  of  the  execution  of  Vrn-cntrd-dynmcs.  Each  row  of  plots  rep¬ 
resents  an  iteration  of  the  law.  At  each  round,  each  agent  first  computes  its 
Voronoi  cell,  then  determines  the  centroid,  and  then  moves  towards  it. 


In  the  above  description,  we  require  the  feedback  gain  fcprop  to  belong 
to  the  interval  ]0,  max{^-  diangQ)}]’  This  guarantees  that  the  controls  v,  uj  in 
the  definition  of  ctl  belong  to  the  closed  interval  [—1,1],  and  are  therefore, 
implementable  in  the  unicycle  and  the  differential  drive  robot  models. 

Remark  5.1  (Vehicles  with  general  dynamics).  The  definition  of  the 
control  function  ctl  is  based  on  the  stabilizing  feedback  law  by  Astolh  [1999]. 
The  robot  position  p  is  guaranteed  to  monotonically  approach  the  target 
position  CM^(V).  In  general,  the  Vrn-cntrd-dynmcs  law  can  be  imple¬ 
mented  over  a  network  of  vehicles  with  arbitrary  dynamics,  as  long  as  these 
vehicles  are  capable  of  strictly  decreasing  the  distance  to  any  specified  po¬ 
sition  in  Q  in  the  time  intervals  between  communication  rounds.  Under 
this  assumption,  the  Vrn-cntrd-dynmcs  law  enjoys  the  same  convergence 
guarantees  as  the  Vrn-cntrd  law,  see  Theorem  5.5  below.  • 


5.2. 1.3  Voronoi-circumcenter  control  and  communication  law 


Here  we  define  the  VRN-CRCMCNTR  control  and  communication  law  for  the 
network  5d,  that  we  denote  by  CCvrn-crcmcntr-  This  law  was  introduced 
by  Cortes  and  Bullo  [2005].  The  law  is  uniform,  static,  and  data-sampled, 
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with  standard  message-generation  function. 


Robotic  Network:  with  discrete-time  motion  model  (4.1.1) 

in  Q,  with  absolute  sensing  of  own  position 

Distributed  Algorithm:  VRN-CRCMCNTR 
Alphabet:  A  =  M.d  U{null} 

function  msg(p,  *) 
i:  return  p 

function  ctl (p,y) 

1:  V  :=Q  n  ( f|{#P,Pr=vd  I  for  a11  non-null  prcvd  G  y}) 

2:  return  CC(V)  —  p 


Since  the  circumcenter  of  a  Voronoi  cell  belongs  to  the  cell  itself,  the 
set  Qn  is  positively  invariant  with  respect  to  the  control  and  communication 
law  CCvrn_CRCMcntr-  From  a  geometric  perspective,  this  law  makes  sense 
as  a  strategy  to  optimize  the  disk-covering  multicenter  function  Tide-  From 
Section  2.1.3,  for  fixed  V,  the  circumcenter  location  minimizes  the  cost  given 
by  the  maximum  distance  to  all  points  in  V.  From  Section  2.3.2,  Tide  can 
be  expressed  (2.3.12)  as  the  maximum  over  the  network  of  each  robot’s 
individual  cost. 


5.2. 1.4  Voronoi-incenter  control  and  communication  law 


Here  we  define  the  Vrn-ncntr  control  and  communication  law  for  the 
network  5d,  that  we  denote  by  CCyRN_NCNTR.  This  law  was  introduced 
by  Cortes  and  Bullo  [2005].  The  law  is  uniform,  static,  and  data-sampled, 
with  standard  message-generation  function. 


Robotic  Network:  5d  with  discrete-time  motion  model  (4.1.1) 
in  Q,  with  absolute  sensing  of  own  position 

Distributed  Algorithm:  VRN-NCNTR 
Alphabet:  A  =  M.d  U{null} 

function  msg(p,  i) 
i:  return  p 

function  ctl (p,y) 

I:  V  ■=  Q  n  (  f]{HP,P^d  |  for  all  non-null  prcvd  £  y}) 

2:  return  x  E  IC (V)-p 
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Since  the  incenter  set  of  a  Voronoi  cell  belongs  to  the  cell  itself,  the  set  Qn 
is  positively  invariant  with  respect  to  the  control  and  communication  law 
CCvrn-ncntr-  From  a  geometric  perspective,  this  law  makes  sense  as  a  strat¬ 
egy  to  optimize  the  sphere-packing  multicenter  function  7isp.  From  Sec¬ 
tion  2.1.3,  for  fixed  V,  the  incenter  locations  maximize  the  cost  given  by  the 
minimum  distance  to  the  boundary  of  V.  From  Section  2.3.3,  7isp  can  be  ex¬ 
pressed  (2.3.15)  as  the  minimum  over  the  network  of  each  robot’s  individual 
cost. 

Remark  5.2  ( “Move-toward-furthest-vertex”  and  “away-from-clos- 
est-neighbor”  coordination  algorithms).  Consider  the  coordination  al¬ 
gorithm  where,  at  each  time  step,  each  robot  moves  towards  the  furthest- 
away  vertex  of  its  own  Voronoi  cell.  Alternatively,  consider  the  coordi¬ 
nation  algorithm  where,  at  each  time  step,  each  robot  moves  away  from 
its  closest  neighbor.  Both  coordination  algorithms  define  maps  which  de¬ 
pend  discontinuously  on  the  robots’  positions.  Cortes  and  Bullo  [2005]  study 
the  asymptotic  behavior  of  these  laws,  and  show  that  the  “move-toward- 
furthest-vertex”  algorithm  monotonically  optimizes  the  multicenter  function 
77dc;  while  the  “away-from-closest-neighbor”  algorithm  monotonically  opti¬ 
mizes  the  multicenter  function  TCsp.  • 


5.2.2  Geometric-center  laws  with  range-limited  interactions 

Next,  we  present  two  control  and  communication  laws  on  the  network  5ld- 
Both  laws  prescribe  a  geometric  centering  strategy  for  each  robot  and  ac¬ 
complish  specific  forms  of  expected- value  optimization.  The  Lmtd-Vrn- 
nrml  law  optimizes  the  area  multicenter  function  7fa rea,|,  while  the  Lmtd- 
Vrn-cntrd  law  optimizes  the  mixed  distortion-area  multicenter  function 

Ffdistor-area,  ^  • 


5.2.2. 1  Limited-Voronoi-normal  control  and  communication  law 

Here  we  define  the  Lmtd-Vrn-nrml  control  and  communication  law  for 
the  network  5ld-  This  law  was  introduced  by  Cortes  et  al.  [2005].  The 
Lmtd-Vrn-nrml  law,  that  we  denote  by  CClmtd-Vrn-nrmlj  uses  the  notion 
of  ([-limited  Voronoi  partition  inside  Q.  The  law  is  uniform,  static,  and 
data-sampled,  with  standard  message-generation  function. 


Robotic  Network:  5ld  with  discrete-time  motion  model  (4.1.1) 
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with  absolute  sensing  of  own  position,  and 
with  communication  range  r,  in  Q 

Distributed  Algorithm:  LMTD-VRN-NRML 
Alphabet:  A  =  MdU{null} 

function  msg (p,i) 
i:  return  p 

function  ctl (p,y) 

I:  V  :=  Q  0  (  f|{#P,Prcvd  I  for  a11  non-null  pIcvd  E  y}) 

2:  v  :=  fvndB(P^)nout,B(P,^^dq 

3:  A*  :=  max{A  |  8  >  Jyn^^p+gv  ^  4>(q)dq  is  strictly  increasing  on  [0,  A]} 

4:  return  A*u 


Note  that  the  direction  of  motion  v  specified  by  the  control  function  ctl 
coincides  with  the  gradient  of  the  multicenter  function  Ha rea,§-  The  pa¬ 
rameter  A*  corresponds  to  performing  a  line  search  procedure  along  the 
direction  v. 

The  control  function  has  the  property  that  the  point  p  +  ctl  (p,  y)  is  guar¬ 
anteed  to  be  in  the  interior  of  V.  This  can  be  justified  by  noting  that  for 
fixed  V,  the  gradient  of  the  function  p  — >  Jynj^(p  (j)(q)dq  at  points  in  the 

boundary  of  V  is  non- vanishing  and  points  toward  the  interior  of  V  (cf.  Ex¬ 
ercise  E2.4).  As  a  consequence,  the  line  search  procedure  terminates  before 
reaching  the  boundary  of  V.  This  discussion  guarantees  that  the  set  Qn 
is  positively  invariant  with  respect  to  the  control  and  communication  law 
CClmtd-V  rn-nrml  • 


5. 2. 2. 2  Limited-Voronoi-centroid  control  and  communication  law 

Here  we  define  the  Lmtd-Vrn-cntrd  control  and  communication  law  for 
the  network  5ld-  This  law  was  introduced  by  Cortes  et  al.  [2005].  The 
Lmtd-Vrn-cntrd  law,  that  we  denote  by  CClmtd-Vrn-cntrdj  uses  the  notion 
of  ^-limited  Voronoi  partition  inside  Q  and  of  centroid  of  the  individual  |- 
limited  Voronoi  cells.  The  law  is  uniform,  static,  and  data-sampled,  with 
standard  message-generation  function. 


Robotic  Network:  5ld  with  discrete-time  motion  model  (4.1.1) 
with  absolute  sensing  of  own  position,  and 
with  communication  range  r,  in  Q 
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Distributed  Algorithm:  Lmtd-Vrn-CNTRD 
Alphabet:  A  =  MdU{null} 

function  msg (p,i) 
l:  return  p 

function  ctl (p,y) 

1:  V  :=  Q  0  B(p,  §)  O  (  f|{#P,Prcvd  I  for  a11  non-null  prcvd  G  y }) 
2:  return  CM^V)  —  p 


The  centroid  of  a  ^-limited  Voronoi  cell  belongs  to  the  cell  itself,  and  this 
fact  guarantees  that  the  set  Qn  is  positively  invariant  with  respect  to  the 
control  and  communication  law  CClmtd-Vrn-cntrd-  Moreover,  note  that  the 
direction  of  motion  specified  by  the  control  function  ctl  coincides  with  the 
gradient  of  the  multicenter  function  77distor-area,§  • 

Remark  5.3  (Relative  sensing  version).  It  is  possible  to  implement  the 
limited- Voronoi-normal  and  limited- Voronoi-centroid  laws  as  static  relative¬ 
sensing  control  laws  on  the  relative-sensing  network  57-  k.  This  is  a  conse¬ 
quence  of  the  fact  that  the  r-limited  Delaunay  graph  is  spatially  distributed 
over  the  r-disk  graph  (cf.  Theorem  2.7 (iii)) .  Let  us  present  one  of  these 
examples  for  completeness. 


Relative  Sensing  Network:  55?  k  with  motion  model  (4.1.2) 
in  Q,  no  communication,  relative  sensing  for  robot  i  given  by: 
robot  measurements  y  contains  p G  B( O2,  r)  for  all  j  ^  i 
environment  measurement  is  ye nv  =  (Qe)i  H  B(0d,  r) 

Distributed  Algorithm:  RELATIVE-SENSING  Lmtd-Vrn-CNTRD 

function  ctl(y,  ye nv) 

i:  V  :=  ye nv  n  B(0d,  §)  n  (  Dl-^o d,PanBd  |  for  all  non-null  psnsd  G  y}) 
2:  return  CM^D) 


Note  that  only  the  positions  of  neighboring  robots  in  the  r-limited  Delau¬ 
nay  graph  have  an  effect  in  the  computation  of  the  set  V.  • 

Remark  5.4  (Range-limited  version  of  Vrn-cntrd).  The  Lmtd-Vrn- 
nrml  and  Lmtd-Vrn-cntrd  laws  can  be  combined  into  a  single  control  and 
communication  law  to  synthesize  an  algorithm  that  monotonically  optimizes 
the  function  7fdistor-area,  with  b  =  —  diam(Q)2.  The  law  is  uniform,  static, 
and  data-sampled,  with  standard  message-generation  function.  This  law, 
that  we  term  Rng-Vrn-cntrd,  is  uniform,  static,  and  data-sampled,  with 
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standard  message-generation  function. 


Robotic  Network:  5ld  with  discrete-time  motion  model  (4.1.1) 
in  Q,  with  absolute  sensing  of  own  position,  and 
with  communication  range  r 

Distributed  Algorithm:  Rng-Vrn-cntrd 
Alphabet:  A  =  M.d  U{null} 

function  msg(p,  *) 
i:  return  p 


function  ctl (p,y) 

1:  V  :=  Q  0  (  f|{#P,Pr=vd  I  for  a11  non-null  pTCvd  €  y}) 

2:  Vl  :=  2A^(7nI(p,|))(CM^(7nI(p,  §))  -p) 

3:  V2  :=  (diam(Q)2  -  f)  fvndB(p,r)  nont, B(p^^dq 

4:  A*  :=  argmax{J  i-»  7iv(p  +  S(vi  +  V2),  B(p  +  5(vi  +  V2),  §)) 

is  strictly  increasing  on  (0,e)} 


5:  return  A*(ui  +  V2) 


In  the  above  algorithm,  Tiy  is  defined  as  in  Exercise  E2.7.  For  a  point 
p  G  V  and  a  closed  ball  B  centered  at  a  point  in  V  with  radius  | ,  we  define 

Bv(p,B)  =  -  I  _\\q  -  p\\l(j)(q)dq  -  f  _  a2(p(q)dq. 

JvnB  Jvn(Q\B) 

The  Rng-Vrn-cntrd  law  is  relevant  because  of  the  following  discussion. 
Recall  from  Proposition  2.18  that  the  general  mixed  distortion-area  multi¬ 
center  function  can  be  used  to  provide  constant-factor  approximations  of 
the  distortion  function  Tfdistor-  As  we  discussed  in  Section  2.3.1,  robots 
with  range- limited  interactions  cannot  implement  Vrn-cntrd  because,  for 
a  given  r  e  M>o,  Qd  is  not  in  general  spatially  distributed  over  f?disk(r)  (cf. 
Remark  2.11).  However,  robotic  agents  with  range-limited  interactions  can 
implement  the  computations  involved  in  Lmtd-Vrn-nrml  and  Lmtd-Vrn- 
CNTRD,  and  hence  can  optimize  7fdistor-area,^,b-  Assuming  r  <  2diam(Q),  it 
is  fair  to  say  that  the  above  algorithm  can  be  understood  as  a  range-limited 
version  of  the  Vrn-cntrd  law.  • 


5.2.3  Correctness  and  complexity  of  geometric-center  laws 

In  this  section  we  characterize  convergence  and  complexity  properties  of  the 
geometric-center  laws.  The  asynchronous  execution  of  the  Voronoi-centroid 
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control  and  communication  law  can  be  studied  as  an  asynchronous  gradient 
dynamical  system;  see  [Cortes  et  al.,  2004]. 

The  following  theorem  summarizes  the  results  known  in  the  literature 
about  the  asymptotic  properties  of  these  laws. 

Theorem  5.5  (Correctness  of  the  geometric-center  algorithms). 

For  d  E  N,  r  E  M>o  and  £  €  M>o,  the  following  statements  hold. 

(i)  on  the  network  Sd,  the  law  CCvrn-cntrd  and  on  the  network  Sve hides; 
the  law  CCvrn-cntrd-dynmcs  both  achieve  the  e-distortion  deployment 
task  ^-distor-dpiy  ■  Moreover,  any  execution  of  CCy rn-cntrd  and  CCvrn-cntrd-dynmcs 
monotonically  optimizes  the  multicenter  function  Tfdistor/ 

(ii)  on  the  network  Sd,  any  execution  of  the  law  CCy rn-crcmcntr  mono¬ 
tonically  optimizes  the  multicenter  function  Tide,' 

(Hi)  on  the  network  Sd,  any  execution  of  the  law  CCy rN-Ncntr  monoton¬ 
ically  optimizes  the  multicenter  function  Hsp; 

(iv)  on  the  network  5ld>  the  law  CClmtd-Vrn-nrml  achieves  the  e-r-area 

deployment  tasA;  7^_r_area_dpiy .  Moreover,  any  execution  o/CClmtd-Vrn-nrml 
monotonically  optimizes  the  multicenter  function  7tare a,p‘  and 

(v)  on  the  network  5ld,  the  law  CClmtd-Vrn-cntrd  achieves  the  e-r- 
distortion-area  deployment  task  ^-r-distor-area-dpiy  ■  Moreover,  any 
execution  of  CClmtd-Vrn-cntrd  monotonically  optimizes  the  multi¬ 
center  function  77distor-area,§  • 

The  proof  of  this  theorem  is  given  in  Section  5.5.1.  The  results  on 
CCvRN_CNTRD  and  CCvrn-cntrd-dynmcs  appeared  originally  in  [Cortes  et  ah, 

2004],  Note  that  an  execution  of  CCvrn-cntrd  can  be  viewed  as  an  alternating 
sequence  of  configuration  of  points  and  partitions  of  the  space  with  the  prop¬ 
erties  that  (i)  each  configuration  of  points  corresponds  to  the  set  of  centroid 
locations  of  the  immediately  preceding  partition  in  the  sequence,  and  (ii) 
each  partition  corresponds  to  the  Voronoi  partition  determined  by  the  im¬ 
mediately  preceding  configuration  of  points  in  the  sequence.  The  monotonic 
behavior  of  Tfdistor  now  follows  from  Propositions  2.14  and  2.15.  Similar 
interpretations  can  be  given  to  all  other  laws.  In  particular,  the  mono¬ 
tonic  behavior  of  Tide  along  executions  of  CCvrn-crcmcntr  can  be  established 
via  Proposition  2.20,  and  the  monotonic  behavior  of  Ti.sp  along  executions 
of  CCvrn-ncntr  can  be  established  via  Proposition  2.22.  Continuous-time 
versions  of  these  laws  are  studied  by  Cortes  and  Bullo  [2005]  via  nonsmooth 
stability  analysis,  where  the  following  convergence  properties  are  established 
(recall  the  notion  of  active  and  passive  nodes  introduced  in  Sections  2.3.2 
and  2.3.3):  all  active  agents  are  guaranteed  to  asymptotically  reach  the  cir- 
cumcenter  (respectively,  incenter)  of  their  Voronoi  region,  whereas  it  is  not 
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known  if  the  same  conclusion  holds  for  the  passive  agents.  Depending  on 
the  polytope  Q,  there  exist  circumcenter  and  incenter  Voronoi  configura¬ 
tions  where  not  all  agents  are  active,  and  simulations  show  that  in  some 
cases  the  continuous-time  versions  of  CCyRN_CRCMCNTR  and  CCvrn-ncntr  con¬ 
verge  to  them.  It  is  an  open  research  question  to  show  that  CCvrn-crcmcntr 
and  CCvRn-ncntr  achieve  the  e-disk-covering  deployment  task  7^_dc-dpiy  and 
the  e-sphere-packing  deployment  task  T-sp-dpiy,  respectively.  Finally,  the 
results  on  CClmtd-Vrn-nrml  and  CClmtd-Vrn-cntrd  appeared  in  [Cortes  et  al., 
2005], 

Next,  we  analyze  the  time  complexity  of  CClmtd-Vrn-cntrd-  We  provide 
complete  results  only  for  the  case  d  =  1  and  uniform  density.  We  assume 
that  diam((5)  is  independent  of  n,  r  and  e. 

Theorem  5.6  (Time  complexity  of  Lmtd-Vrn-cntrd  law).  Assume 
the  robots  evolve  in  a  closed  interval  QcR,  that  is,  d  =  1,  and  assume  that 
the  density  is  uniform,  that  is,  cf>  =  1.  For  r  E  M>o  and  e  E  M>o,  on  the 
network  5ld  TC(7^_r._ciistor-area-dply 7  ^^Lmtd-Vrn-cntrd)  £  0(jl  log(/lc  )). 

The  proof  of  this  result  is  given  in  Section  5.5.2  following  the  treatment 
in  [Martinez  et  al.,  2007b], 

Remark  5.7  (Congestion  effects).  Interestingly,  Theorem  5.6  also  holds 
if,  motivated  by  wireless  congestion  considerations,  we  take  the  communi¬ 
cation  range  r  to  be  a  monotone  non-increasing  function  r  :  N  — ►  ]0,  27r[  of 
the  number  of  robotic  agents  n.  • 

5.3  SIMULATION  RESULTS 

In  this  section,  we  illustrate  the  execution  of  the  various  control  and  com¬ 
munication  laws  introduced  in  this  chapter. 


Geometric-center  algorithms  for  expected-value  optimization 

The  Vrn-cntrd,  Lmtd-Vrn-nrml,  and  Lmtd-Vrn-cntrd  control  and 
communication  laws  are  implemented  in  Mathematica®  as  a  library  of  rou¬ 
tines  and  a  main  program  running  the  simulation. 

The  objective  of  a  first  routine  is  to  compute  the  [[-limited  Voronoi  par¬ 
tition  and  parameterize  each  cell  V). r ,  i  e  {1, . . .  ,n}  in  polar  coordinates. 
The  objective  of  a  second  routine  is  to  compute  the  surface  integrals  on  these 
sets  and  the  line  integrals  on  their  boundaries  via  the  numerical  integration 
routine  NIntegrate.  We  pay  careful  attention  to  numerical  accuracy  issues 
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in  the  computation  of  the  Voronoi  diagram  and  in  the  integration. 

Measuring  displacements  in  meters,  we  consider  the  polygon  Q  determined 
by  the  vertices 

{(0, 0),  (2.125, 0),  (2.9325, 1.5),  (2.975, 1.6), 

(2.9325, 1.7),  (2.295, 2.1),  (0.85,  2.3),  (0.17, 1.2)}. 

The  diameter  of  Q  is  diam(Q)  =  3.37796.  In  all  figures,  the  density  func¬ 
tion  4>  is  the  sum  of  four  Gaussian  functions  of  the  form  11  exp(6(— (x  — 
^center)2  —  (y  —  2/center)2))  and  is  represented  by  means  of  its  contour  plot. 
Darker  blue  areas  correspond  to  higher  values  of  the  density  function.  The 
centers  (xcenter)  2/center)  of  the  Gaussian  functions  are  given  by  (2.15,  .75), 
( 1 . ,  .25) ,  (.725,1.75)  and  (.25,  .7),  respectively.  The  area  of  the  polygon  is 
A  0(Q)  =  17.6352. 

We  show  evolutions  of  (5d,  Vrn-cntrd)  and  (5d,  Vrn-cntrd-dynmcs) 
in  Figures  5.2  and  5.3,  respectively.  One  can  verify  that  the  final  network 
configurations  are  centroidal  Voronoi  configuration.  In  other  words,  the  task 
7^-distor-dpiy  is  achieved,  as  guaranteed  by  Theorem  5.5(i). 


Figure  5.2  Evolution  of  (Sd,  Vrn-cntrd)  with  n  =  20  robots.  The  left  (respectively, 
right)  figure  illustrates  the  initial  (respectively,  final)  locations  and  Voronoi 
partition.  The  central  figure  illustrates  the  evolution  of  the  robots.  After 
13  seconds,  the  value  of  the  multicenter  function  Tfdistor  has  monotonically 
increased  to  approximately  —.515. 


We  show  an  evolution  of  (5ld,  Lmtd-Vrn-nrml)  in  Figure  5.4.  One 
can  verify  that  the  final  network  configuration  is  a  ((-limited  area-centered 
Voronoi  configuration.  In  other  words,  the  task  X_r_area-dpiy  is  achieved,  as 
guaranteed  by  Theorem  5.5(h). 

We  show  an  evolution  of  (5ld,  Lmtd-Vrn-cntrd)  in  Figure  5.5.  One  can 
verify  that  the  final  network  configuration  is  a  ((-limited  centroidal  Voronoi 
configuration.  In  other  words,  the  task  T(-r-distor-area-dpiy  is  achieved,  as 
guaranteed  by  Theorem  5.5(iii). 

We  show  an  evolution  of  (5ld5  Rng-Vrn-cntrd)  in  Figure  5.6.  One  can 
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Figure  5.3  Evolution  of  (SD,  Vrn-CNTRD-dynmcs)  with  n  =  20  robots.  The  feedback  gain 
is  fcprop  =  3.5.  The  left  (respectively,  right)  figure  illustrates  the  initial  (re¬ 
spectively,  final)  locations  and  Voronoi  partition.  The  central  figure  illustrates 
the  evolution  of  the  robots.  After  20  seconds,  the  value  of  the  multicenter 
function  77distor  has  monotonically  increased  to  approximately  —.555. 


Figure  5.4  Evolution  of  (5ld,  Lmtd-Vrn-nrml)  with  n  =  20  robots  and  r  =  0.4.  The 
left  (respectively,  right)  figure  illustrates  the  initial  (respectively,  final)  loca¬ 
tions  and  Voronoi  partition.  The  central  figure  illustrates  the  evolution  of  the 
robots.  The  ^-limited  Voronoi  cell  of  each  robot  is  plotted  in  light  gray.  Af¬ 
ter  36  seconds,  the  value  of  the  multicenter  function  Tfarea, r  is  approximately 
14.141. 


Figure  5.5  Evolution  of  (5ld,  Lmtd-Vrn-cntrd)  with  n  =  20  robots  and  r  =  0.4.  The 
left  (respectively,  right)  figure  illustrates  the  initial  (respectively,  final)  loca¬ 
tions  and  Voronoi  partition.  The  central  figure  illustrates  the  evolution  of  the 
robots.  The  |-limited  Voronoi  cell  of  each  robot  is  plotted  in  light  gray.  After 
90  seconds,  the  value  of  the  multicenter  function  77distor-area,  s  is  approximately 
-.386. 
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verify  that  the  final  network  configuration  corresponds  to  a  critical  point 
of  the  mixed  distortion- area  multicenter  function  Hdistor-area,|,ft>  with  b  = 
—  diam(Q)2,  see  Exercise  E5.3. 


Figure  5.6  Evolution  of  (Sld,  Rng-Vrn-CNTRd)  with  n  =  20  robots  and  r  =  0.47.  The 
left  (respectively,  right)  figure  illustrates  the  initial  (respectively,  final)  loca¬ 
tions  and  Voronoi  partition.  The  central  figure  illustrates  the  evolution  of 
the  robots.  The  |-limited  Voronoi  cell  of  each  robot  is  plotted  in  light  gray. 
After  13  seconds,  the  value  of  the  multicenter  function  77distor-area,  j  ,t,  with 
b  =  —  diam(Q)2,  is  approximately  —4.794. 


As  discussed  in  Remark  5.4,  the  Rng-Vrn-cntrd  can  be  understood  as 
a  range-limited  implementation  of  Vrn-cntrd  in  a  network  of  robots  with 
range-limited  interactions.  Let  us  briefly  compare  the  evolutions  depicted 
in  Figures  5.2  and  5.6.  According  to  Proposition  2.18,  we  compute 


P 


r 

2 

diamQ 


0.06957. 


From  the  constant-factor  approximation  (2.3.7),  the  absolute  error  is  guar¬ 
anteed  to  be  less  than  or  equal  to  (/32  —  l)77distor-area,j,&(-Pfinai)  ~  4.77,  where 
-Pfinai  denotes  the  final  configuration  in  Figure  5.6.  The  percentage  error  in 
the  value  of  the  multicenter  function  Tfdistor  between  the  final  configuration 
of  the  evolution  in  Figure  5.2  and  the  final  configuration  of  the  evolution  in 
Figure  5.6  is  approximately  equal  to  3.277%.  As  expected,  one  can  verify  in 
simulations  that  the  percentage  error  of  the  performance  of  the  range-limited 
implementation  improves  with  higher  values  of  the  ratio  ■ 


Geometric-center  algorithms  for  disk-covering  and  sphere-packing 


The  Vrn-CRCMCNTR  and  Vrn-ncntr  control  and  communication  laws  are 
implemented  in  Mathematica®  as  a  single  centralized  program  running  the 
simulation.  We  compute  the  bounded  Voronoi  diagram  of  a  collection  of 
points  using  the  package  ComputationalGeometry.  We  compute  the  cir- 
cumcenter  of  a  polygon  via  the  algorithm  in  [Skyum,  1991]  and  the  incenter 
set  via  the  LinearProgramming  solver  in  Mathematica. 

Measuring  displacements  in  meters,  we  consider  the  polygon  determined 
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by  the  vertices 

{(0,  0),  (2.5, 0),  (3.45, 1.5),  (3.5, 1.6),  (3.45, 1.7),  (2.7,  2.1),  (1.,  2.4),  (.2, 1.2)}. 

We  show  an  evolution  of  (5d,  Vrn-CRCMCNTr)  in  Figure  5.7.  One  can 
verify  that  at  the  final  configuration  all  robots  are  at  the  circumcenter  of 
their  own  Voronoi  cell.  In  other  words,  the  task  7^_dc-dpiy  is  achieved  by  this 
evolution.  As  stated  in  Section  5.2.3,  it  is  an  open  research  question  to  show 
that  this  fact  holds  in  general  for  CCvrn_cromcntr.  Cortes  and  Bullo  [2005] 
prove  a  similar  result  for  a  continuous-time  implementation  of  this  law. 


Figure  5.7  Evolution  of  (Sd,  Vrn-CRCMCNTr)  with  n  =  16  robots.  The  left  (respectively, 
right)  figure  illustrates  the  initial  (respectively,  final)  locations  and  Voronoi 
partition.  The  central  figure  illustrates  the  evolution  of  the  robots.  After  20 
seconds,  the  value  of  the  multicenter  function  Tide  has  monotonically  decreased 
to  approximately  0.43273  meters. 


We  show  an  evolution  of  (<Sd,  Vrn-ncntr)  in  Figure  5.8.  One  can  verify 
that  at  the  final  configuration  all  robots  are  at  the  incenter  of  their  own 
Voronoi  cell.  In  other  words,  the  task  7^_sp_dpiy  is  achieved  by  this  evolution. 
As  stated  in  Section  5.2.3,  it  is  an  open  research  question  to  show  that  this 
fact  holds  in  general  for  CCyRN_NCNTR.  Cortes  and  Bullo  [2005]  prove  a  similar 
result  for  a  continuous-time  implementation  of  this  law. 


Figure  5.8  Evolution  of  (5d,  Vrn-ncntr)  with  n  =  16  robots.  The  left  (respectively, 
right)  figure  illustrates  the  initial  (respectively,  final)  locations  and  Voronoi 
partition.  The  central  figure  illustrates  the  evolution  of  the  robots.  After  20 
seconds,  the  value  of  the  multicenter  function  Ttsp  has  monotonically  increased 
to  approximately  0.2498  meters. 
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5.4  NOTES 

The  deployment  problem  studied  in  this  chapter  is  related  to  the  literature 
on  facility  location  [Drezner,  1995,  Okabe  et  ah,  2000,  Du  et  ah,  1999]  and 
geometric  optimization  [Agarwal  and  Sharir,  1998,  Boltyanski  et  ah,  1999], 
see  also  Section  2.4.  These  disciplines  study  spatial  resource  allocation  prob¬ 
lems  and  play  an  important  role  in  quantization  theory,  mesh  and  grid  op¬ 
timization  methods,  clustering  analysis,  data  compression,  and  statistical 
pattern  recognition. 

Dispersion  laws  have  been  traditionally  studied  in  behavior  control,  see 
e.g.,  [Arkin,  1998,  Schultz  and  Parker,  2002,  Balch  and  Parker,  2002].  De¬ 
ployment  algorithms  that  make  use  of  potential  field  methods  are  pro¬ 
posed  by  Payton  et  ah  [2001],  Howard  et  ah  [2002].  Other  works  include 
[Bulusu  et  ah,  2001]  on  adaptive  beacon  placement  for  localization,  [Poduri  and  Sukhatme, 
2004]  on  network  deployments  that  satisfy  a  pre-specified  constraint  in  the 
number  of  neighbors  of  each  robot,  [Arsie  and  Frazzoli,  2007]  on  deployment 
strategies  that  minimize  the  expected  time  needed  for  any  robot  to  service  a 
newly-appeared  target  point  in  the  environment,  and  [Hussein  and  Stipanovic, 

2007]  on  dynamically  surveying  a  known  environment. 

Deployment  algorithms  for  coverage  control  are  a  subject  of  active  re¬ 
search.  Among  the  most  recent  works,  Martinez  [2007],  Schwager  et  ah 
[2008b]  consider  coverage  problems  where  the  density  function  is  unknown,  Lekien  and  Leonard 
[2007]  propose  centralized  laws  for  non-uniform  coverage  using  cartograms,  de  Silva  and  Christ 
[2007]  study  static  coverage  problems  with  minimal  assumptions  on  the  ca¬ 
pabilities  of  individual  sensors  using  algebraic  topology,  Kwok  and  Martinez 
[2008]  propose  distributed  deployment  strategies  for  energy-constrained  net¬ 
works,  Laventall  and  Cortes  [2008]  design  distributed  algorithms  for  net¬ 
works  of  robots  whose  sensors  have  range-limited  wedge-shaped  footprints,  Gao  et  ah 
[2008]  consider  discrete  coverage  problems,  Schwager  et  ah  [2008a]  consider 
combined  exploration  and  deployment  problems,  and  Zhong  and  Cassandras 
[2008],  Pimenta  et  ah  [2008],  Caicedo-Nuhez  and  Zefran  [2008]  deal  with 
centroidal  Voronoi  tessellations  in  nonconvex  environments.  Graham  and  Cortes 
[2008]  study  the  optimality  of  circumcenter  and  incenter  Voronoi  configu¬ 
rations  for  the  estimation  of  stochastic  spatial  fields.  Susca  et  ah  [2007] 
consider  some  planar  interpolation  problems. 

Deployment  problems  play  a  relevant  role  in  other  coordination  tasks, 
such  as  surveillance,  search  and  rescue,  and  exploration  and  map  building 
of  unknown  environments.  Choset  [2001]  considers  sweep  coverage  problems, 
where  one  or  more  robots  equipped  with  limited  footprint  sensors  have  to 
visit  all  points  in  the  environment.  In  [Simmons  et  ah,  2000],  deployment 
locations  for  a  network  of  heterogeneous  robots  are  user-specified  after  an 
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initial  map  of  the  unknown  environment  has  been  built.  Gupta  et  al.  [2006a] 
consider  a  combined  sensor  coverage  and  selection  problem. 

Deployment  of  robotic  agents  with  visibility  sensors  has  been  studied  un¬ 
der  a  variety  of  assumptions.  When  the  environment  is  known  a  priori,  the 
problem  can  be  cast  as  the  classical  Art  Gallery  Problem  [Chvatal,  1975] 
from  computational  geometry,  where  one  is  interested  in  achieving  complete 
visibility  with  the  minimum  number  of  agents  possible.  The  Art  Gallery 
Problem  is  computationally  hard  [Lee  and  Lin,  1986,  Eidenbenz  et  ah,  2001] 
and  the  best  known  approximation  algorithms  yield  solutions  within  a  loga¬ 
rithmic  factor  of  the  optimum  number  of  agents  [Ghosh,  1987,  Efrat  and  Har-Peled, 
2006].  Hernandez-Penalver  [1994],  Pinciu  [2003]  study  the  problem  of  achiev¬ 
ing  full  visibility  while  guaranteeing  that  the  final  network  configuration 
will  have  a  connected  visibility  graph.  Recent  works  on  multi-robot  ex¬ 
ploration  of  unknown  environments  include  [Batalin  and  Sukhatme,  2004, 
Burgard  et  al.,  2005,  Howard  et  al.,  2006].  Topological  exploration  of  graph¬ 
like  environments  by  single  and  multiple  robots  is  studied  in  [Rekleitis  et  al., 

2001,  Dynia  et  al.,  2006,  Fraigniaud  et  al.,  2004].  A  simple  one-step  strategy 
for  visibility  deployment,  without  the  need  for  synchronization,  achieving  the 
worst-case  optimal  bounds  in  terms  of  number  of  robots  required,  and  under 
limited  communication  is  presented  in  [Ganguli  et  al.,  2007a]. 


5.5  PROOFS 

This  section  gathers  the  proofs  of  the  main  results  presented  in  the  chapter. 


5.5.1  Proof  of  Theorem  5.5 

Proof.  Let  Po  =  (p^(0), . . .  ,p^(0))  £  Qn  denote  the  initial  condition.  The 
proof  strategy  for  all  five  facts  is  similar.  We  first  establish  the  monotonic 
behavior  of  the  corresponding  multicenter  function  along  the  executions  of 
the  control  and  communication  law.  Then,  we  use  the  LaSalle  Invariance 
Principle  to  determine  the  sets  of  asymptotic  convergence  of  the  executions. 
Finally,  we  characterize  these  sets  using  geometric  properties  of  the  multi¬ 
center  functions. 

Fact  (i).  Let  us  start  by  showing  that  executions  of  CCvrn-cntrd  mono- 
tonically  optimize  the  function  Tfdistor-  For  convenience,  we  denote  by 
/v rn-cntrd  :  Qn  — *  Qn  the  map  induced  by  the  execution  of  one  step  of 
the  law  CCvrn-cntrd-  Using  the  extension  of  the  multicenter  function  defined 
over  set  of  points  and  partitions  of  Q,  we  deduce  from  Proposition  2.14  that, 
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for  P  G  Qn, 

"Rdistor  (/vrn-cntrd  (P))  =  -Hdistcr(/V  RN-CNTRD  (P),V(fv  RN-CNTRD  (O) 

P  T^distor  (/vrn-cntrd  (P)MP))- 

The  application  of  Proposition  2.15  yields 

T^distor  (/vrn-cntrd  (P),V(P))>  ndistor(P,V(P)), 

and  hence  Wdistor  (/vrn-cntrd  (-0)  >  Tfdistor (P)-  Since  all  Voronoi  cells  have 
non- vanishing  areas,  the  inequality  is  strict  unless  /vrN-cntrd  (■ P )  =  P- 

The  result  on  the  monotonic  evolution  of  Tfdistor  along  the  executions  of 
CCvrn-cntrd-dynmcs  can  be  established  in  a  similar  way  making  use  of  Propo¬ 
sition  2.14  and  Exercise  E2.5.  The  key  property  of  Vrn-cntrd-DYNMCS 
(see  Remark  5.1)  is  that,  when  run  over  a  finite  time  interval  [t,t  +  1],  the 
control  function  ctl  guarantees  that,  for  each  robot  i  6  {l,...,n}, 

II PW(*  +  1)  -  CM0(E)||2  <  \\p®(t)  -  CM0(V)||2, 

with  strict  inequality  unless  pM(t)  =  CM^(T). 

The  convergence  result  for  both  laws  is  established  in  the  same  way  using 
the  LaSalle  Invariance  Principle  stated  in  Theorem  1.16.  Here,  we  state  the 
proof  for  Vrn-cntrd.  Since  the  set  Qn  is  compact  and  — Hdistor  is  non¬ 
increasing  along  /v rn-cntrd,  we  deduce  that  the  execution  of  CC vrn-cntrd 
starting  from  Po  tends  to  the  largest  positively  invariant  set  M  contained  in 

{P€Qn  |  Wdistor (/V  RN-CNTRD  (P))  =  Wdistor  (P)}- 

The  set  M  is  precisely  the  set  of  centroidal  Voronoi  configurations.  This  is 
a  consequence  of  the  fact  that  Wdistor (/vrn-cntrd (P))  =  Wdistor (P)  implies 
/vrn-cntrd  (P)  =  P,  i.e.,  P  is  a  centroidal  Voronoi  configuration. 

Facts  (ii)  and  (Hi).  The  proof  of  these  facts  runs  parallely  to  the  proof 
of  fact  (i).  Propositions  2.20  and  2.22  are  key  in  establishing  the  monotonic 
evolution  of  Tide  and  Ti.sp,  respectively. 

Fact  (iv).  For  convenience,  we  denote  by  /lmtd-Vrn-nrml  :  Qn  — >  Qn 
the  map  induced  by  the  execution  of  one  step  of  the  law  CClmtd-Vrn-nrml- 
Let  us  show  that  executions  of  CClmtd-Vrn-nrml  monotonically  optimize  the 
function  Harea,§-  Using  the  extension  of  the  multicenter  function  defined 
over  set  of  points  and  partitions  of  Q,  we  deduce  from  Proposition  2.14 
that,  for  P  S  Qn, 

Ftarea,  |  (/lmtd-Vrn-nrml {P )  )  =  ^area,r  (/lmtd-Vrn-nrml  {P)i  U(/lmtd-Vrn-nrml(R))) 

P  77area,r  (/lmtd-Vrn-nrml  (P),V(P))- 
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The  line  search  procedure  for  each  robot  embedded  in  the  definition  of  the 
control  function  of  CClmtd-Vrn-nrml  ensures  that 

Lfarea, |  (/lmtd-Vrn-nrml  (-P) ,  F(-P))  —  77area,  ^  (P,V(P)), 

and  hence  (/lmtd-Vrn-nrml(-P))  >  7iSLTea.,r(P).  Note  that  the  inequal¬ 

ity  is  strict  unless  /lmtd-Vrn-nrml(-P)  =  P-  Tbe  application  of  the  LaSalle 
Invariance  Principle  as  in  the  proof  of  fact  (i)  leads  us  now  to  the  result. 

Fact  (v).  The  proof  of  this  facts  runs  parallely  to  the  proof  of  facts  (i) 
and  (iv).  Propositions  2.14  and  2.16  are  key  in  establishing  the  monotonic 
evolution  of  77distor-area,r  ■  The  convergence  result  is  established  using  the 
LaSalle  Invariance  Principle.  ■ 


5.5.2  Proof  of  Theorem  5.6 

Proof.  For  d  =  1,  Q  is  a  compact  interval  on  M,  say  Q  =  [</_,  q+\.  We  start 
with  a  brief  discussion  about  connectivity.  In  the  r-limited  Delaunay  graph, 
two  agents  that  are  at  most  at  a  distance  r  from  each  other  are  neighbors  if 
and  only  if  there  are  no  other  agents  between  them.  Additionally,  we  claim 
that,  if  agents  i  and  j  are  neighbors,  then  |  CM^(pW)  —  CM^(y[Jl)|  <  r, 
where  V®  denotes  the  set  defined  by  the  control  function  ctl  when  evaluated 
by  agent  i.  To  show  this  fact,  assume  without  loss  of  generality  that  pH  < 
pti  1 .  Let  us  consider  the  case  where  the  agents  have  neighbors  on  both  sides 
(the  other  cases  can  be  treated  analogously).  Let  p^  (respectively,  p+) 
denote  the  position  of  the  neighbor  of  agent  i  to  the  left  (respectively,  of 
agent  j  to  the  right).  Now, 

CM4V®)  =  ^(p®  +  2  pN+pb']), 

CM*(V0)  =  i(pN  +  2  pW  +p\  1), 

where  we  have  used  the  fact  that  cj)  =  1.  Therefore, 

|  CM0(rW)  -  CM^(Vlj])\  <  i(| -pW |  +  2|pW -PW\  +  \pW-pW\)  <  r. 

This  implies  that  agents  i  and  j  belong  to  the  same  connected  component 
of  the  r-limited  Delaunay  graph  at  the  next  time  step. 

Next,  let  us  consider  the  case  when  ^ld(^)  is  connected  at  the  initial 
network  configuration  Po  =  (p^(0), . . .  ,ptnl(0)).  Without  loss  of  generality, 
assume  that  the  agents  are  ordered  from  left  to  right  according  to  their 
unique  identifier,  that  is,  pM(0)  <  ■  ■  ■  <  p^n  1(0).  We  distinguish  three  cases 
depending  on  the  proximity  of  the  leftmost  and  rightmost  agents  1  and  n, 
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respectively,  to  the  boundary  of  the  environment:  case  (a)  both  agents  are 
within  a  distance  5  of  dQ;  case  (b)  none  of  the  two  is  within  a  distance  | 
of  dQ;  and  case  (c)  only  one  of  the  agents  is  within  a  distance  |  of  dQ. 
Here  is  an  important  observation:  from  one  time  instant  to  the  next  one, 
the  network  configuration  can  fall  into  any  of  the  cases  described  above. 
However,  because  of  the  discussion  on  connectivity,  transitions  can  only 
occur  from  case  (b)  to  either  case  (a)  or  (c);  and  from  case  (c)  to  case 
(a).  As  we  show  below,  for  each  of  these  cases,  the  network  evolution  under 
CCvrn-cntrd  can  be  described  as  a  discrete-time  linear  dynamical  system 
which  respects  agents’  ordering. 

Let  us  consider  case  (a).  In  this  case,  we  have 

P[1](£+l)  =  \(P[1](l)+P[2](l))  +  \q-, 

p ^  {£  +  1)  =  ^  ( p ^  (£)  +  2 p^  {£)  +  p^  (£)), 
p^n~1\£+  1)  =  ^(p[n~2](£)  +  2p^n~1\£)  +  p[n](£)), 

pW(£+l)  =  ±(pl”-i](£)+pW(£))  +  ±q+. 

Equivalently,  we  can  write  P(£  +  1)  =  ■  P(£)  +  6(a),  where  the  matrix 

A(a)  G  Mnxn  and  the  vector  6(a)  G  Mn  are  given  by 


-1 

I 

1 

f 

0 

1 

O' 

0 

0 

h- 

A(a)  = 

4 

0 

1 

4 

1 

2 

1 

4 

’  k(a)  - 

0 

0 

0 

1 

1 

1 

0 

4 

0 

! 

4 

1 

I^JV-hI'nF 

_h+. 

Note  that  the  only  equilibrium  network  configuration  P*  respecting  the  or¬ 
dering  of  the  agents  is  given  by 

P*]  =  Q-  +  -^(1  +  2(*  -  1  ))(<?+  ~q~),  i  G  {1, . . .  ,n}, 

and  note  that  this  is  a  |-centroidal  Voronoi  configuration  (under  the  as¬ 
sumption  of  case  (a)).  We  can  therefore  write  (P(£+ 1)  —  P*)  =  A^(P(£)  — 
P*).  Now,  note  that  A(a)  =  ATrid”  Q,^).  Theorem  1.77(h)  implies  that 
lim^+00  {P(£)~P*)  =0n,  and  that  the  maximum  time  required  for  \\P(£)  — 
P*||2  <  e||Po  —  P*\h  (over  all  initial  conditions  in  Mn)  is  @(n2loge_1).  It  is 
not  obvious,  but  it  can  be  verified,  that  the  initial  condition  providing  the 
lower  bound  in  the  time  complexity  estimate  does  indeed  have  the  property 

239 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


of  respecting  the  agents’  ordering;  this  fact  holds  for  all  three  cases  (a),  (b) 
and  (c). 


The  case  (b)  can  be  treated  in  the  same  way.  The  network  evolution  takes 
now  the  form  P{1  +  1)  =  A(b)  ■  P(£)  +  6(b) ,  where  the  matrix  A(b)  E  Rnxn 
and  the  vector  6(b)  £  K"  are  given  by 


■3 

1 

1 

f 

0 

1 

O' 

0 

0 

-\r 

A(b)  = 

4 

0 

1 

4 

? 

2 

1 

4 

>  6(b)  - 

0 

0 

_0 

1 

1 

1 

0 

4 

0 

! 

4 

1 

I  r 

L  a'  J 

In  this  case,  a  (non-unique)  equilibrium  network  configuration  respecting 
the  ordering  of  the  agents  is  of  the  form 

[i]  ■  1  +  n  .  n  i 

Pi  =  ir - - — r,  i  £  {1, . . . ,  nj. 

Note  that  this  is  a  |-centroidal  Voronoi  configuration  (under  the  assumption 
of  case  (b)).  We  can  therefore  write  (P(£+l)  — P*)  =  A^(P(£)  —  P*).  Now, 
note  that  A(b)  =  ATrid+  (|,  §).  We  compute  Pave  =  il£(P0-P*)  =  H  ^P0. 
With  this  calculation,  Theorem  1.77(i)  implies  that  lim^_>+0O  (P(£)  -P*~ 
Paveln)  =  0 n,  and  that  the  maximum  time  required  for  || P{£)  —  P*  — 
Pavel?t || 2  <  e||Po  —  P*  —  Pave  1  n \ | 2  (over  all  initial  conditions  in  Mn)  is 
0(n2  logs-1) . 


Case  (c)  needs  to  be  handled  differently.  Without  loss  of  generality,  as¬ 
sume  that  agent  1  is  within  distance  |  of  dQ  and  agent  n  is  not  (the  other 
case  is  treated  analogously).  Then,  the  network  evolution  takes  now  the 
form  P(£  +  1)  =  A(c)  •  P{£)  +  6(cj ,  where  the  matrix  A(c)  £  M.nxn  and  the 
vector  6(c)  £  Kn  are  given  by 


^4(c) 


■i  l 

I  I 

4  ? 

0  I 


0 

0 


0 

1 

f  1 

2  4 


1 

4 

0 


1 

! 


n „  n 

2  q~ 

0 

’  b(c) 

0 

i„ 

L  4r  J 

Note  that  the  only  equilibrium  network  configuration  P*  respecting  the  or¬ 
dering  of  the  agents  is  given  by 

p£']  =q_+l-(2i-  l)r,  *£{!,..., n}, 
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and  note  that  this  is  a  |-centroidal  Voronoi  configuration  (under  the  as¬ 
sumption  of  case  (c)).  In  order  to  analyze  -A(c),  we  recast  the  n-dimensional 
discrete-time  dynamical  system  as  a  2n-dimensional  one.  To  do  this,  we 
define  a  2n-dimensional  vector  y  by 

y[*l  =  p^l\  i  £  {1, . . . ,  n},  and  yln+*]  =  p^n~l+l\  %  £  {1, . . . ,  n}.  (5.5.1) 

Now,  one  can  see  that  the  network  evolution  can  be  alternatively  described 
in  the  variables  (yM, . . .  ,  yt2nl)  as  a  linear  dynamical  system  determined  by 
the  2 n  x  2 n  matrix  ATrid^~n(|,  |).  Using  Theorem  1.77(h),  and  exploiting 
the  chain  of  equalities  (5.5.1),  we  can  infer  that,  in  case  (c),  the  maximum 
time  required  for  || P{t)  —  P*\\2  —  ell^o  —  P* II 2  (over  all  initial  conditions  in 
Mn)  is  ©(rrloge^1). 

In  summary,  for  all  three  cases  (a),  (b)  and  (c),  our  calculations  show  that, 
in  time  0(n2  log  e_1),  the  error  2-norm  satisfies  the  contraction  inequality 
|| P{tj  —  P*\\2  <  e||-Po  —  P*\\2-  We  convert  this  inequality  on  2-norms  into 
an  appropriate  inequality  on  oo-norms  as  follows.  Note  that  \\Po  —  P*  ||oo  = 
maxjgr1)  n\  |pW(0)  —  p$\  <  (q+  —  q-).  For  i  of  order  n2logy_1,  we  have 

I  P{t)  -  p*  loc  <  \\P(£)  -  p*h  <  v\\Po  -  P*h 

<  r]y/n\\Po  -  P*||oo  <  vVn(q+  -  q~ )• 

This  means  that  e-r-deployment  is  achieved  for  r/\/n(q+  —  c/_)  =  e,  that  is, 
in  time  0(n2  \ogrj~1)  =  0(n2  log(ne-1)). 

Up  to  here  we  have  proved  that,  if  the  graph  ^ld(^)  is  connected  at 
P0,  then  TC(7^_r_dply,CCvaN_CNTRD,  Pq)  €  0(n2  log(ne-1)).  If  GhD(r)  is  not 
connected  at  Pq,  note  that  along  the  network  evolution  there  can  only  be 
a  finite  number  of  time  instants,  at  most  n  —  1  where  a  merging  of  two 
connected  components  occurs.  Therefore,  the  time  complexity  is  at  most 
0(n3  log(ne-1)),  as  claimed.  ■ 

5.6  EXERCISES 

E5.1  (Monotonic  evolution  of  Tide  and  7tsp).  Prove  statements  (ii)  and  (iii)  in 
Theorem  5.5. 

Hint:  Make  use  of  the  optimality  of  the  Voronoi  partition  and  of  center  locations 
stated  in  Propositions  2.20  and  2.22. 

E5.2  (Correctness  of  Lmtd-Vrn-cntrd).  Prove  Theorem  5.5(v). 

Hint:  To  establish  the  monotonic  evolution  of  the  multicenter  function,  make  use 
of  the  optimality  of  the  Voronoi  partition  stated  in  Proposition  2.14  and  of  centroid 
locations  stated  in  Proposition  2.16.  To  establish  the  convergence  result,  make  use 
of  the  LaSalle  Invariance  Principle  stated  in  Theorem  1.16. 

E5.3  (Correctness  of  Rng-Vrn-cntrd).  Mimic  the  proof  of  Theorem  5.5(iv)  to 
show  that  the  evolutions  of  Rng-Vrn-cntrd  monotonically  optimize  the  mixed 
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distortion-area  multicenter  function 

Tf  distor-area,  ^  ,6,  with  b  —  (1  i  d.  I  [I  f  0  )  , 

and  asymptotically  approach  its  set  of  critical  points. 

E5.4  (The  “n-bugs  problem”  and  cyclic  interactions:  continued).  Consider  n 
robots  at  counterclockwise-ordered  positions  61, ... ,6n.  First,  consider  the  cyclic 
balancing  system  described  in  Exercise  El. 29  with  parameter  k  =  1/4  and  given 
by 

9i{£+  1)  =  (.  G  Z>0. 

Second,  consider  the  Voronoi-centroid  law  on  the  circle  (with  uniform  density)  in 
which  each  robots  computes  its  Voronoi  partition  of  the  circle  (see  Figure  2.6)  and 
the  each  robots  moves  to  midpoint  of  its  arc.  Show  that  the  two  behaviors  are 
identical. 
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Chapter  Six 

Boundary  estimation  and  tracking 


The  aim  of  this  chapter  is  to  provide  an  example  of  a  motion  coordination 
algorithm  that  can  be  used  in  a  specific  sensing  task.  This  is  the  task 
of  detection  and  estimation  of  an  evolving  boundary  in  2D  by  a  robotic 
sensor  network.  This  type  of  operation  can  be  of  interest  in  the  validation 
of  oceanographic  and  atmospheric  models  as  well  as  for  the  demarcation  of 
hazardous  environments.  In  the  ocean,  a  boundary  can  delimit  areas  where 
there  are  abrupt  changes  in  temperatures,  which  can  influence  the  marine 
biodiversity  in  those  areas.  In  the  atmosphere,  a  boundary  can  establish 
the  front  of  a  highly-polluting  expanding  substance.  The  containment  of  an 
spreading  fire  is  another  situation  that  can  translate  into  the  specific  task 
of  boundary  estimation  and  tracking. 

Under  full  knowledge  and  centralized  computation,  various  methods  exist 
in  the  literature  to  solve  the  boundary  estimation  task.  A  first  challenge 
that  we  face  when  designing  coordination  algorithms  for  robotic  networks  is 
the  determination  of  the  extent  to  which  these  tasks  can  be  performed  in  a 
distributed  way  and  under  limited  information.  The  algorithm  presented  in 
this  chapter  to  track  environmental  boundaries  is  distributed,  in  the  sense 
that  it  does  not  require  the  use  of  a  central  station  or  “fusion  center.”  Our 
algorithm  builds  on  basic  notions  from  interpolation  theory  and  employs 
distributed  linear  iterations  and  consensus  algorithms.  The  algorithm  can  be 
seen  as  part  of  a  general  effort  to  investigate  distributed  filters  for  estimation 
tasks. 

A  second  challenge  is  posed  by  sudden  events  that  may  occur  when  per¬ 
forming  sensing  tasks,  such  as  the  detection  of  an  intruder  or  an  abrupt 
change  in  the  concentration  of  some  chemical.  These  events  require  a  spe¬ 
cific  action  on  the  part  of  the  network.  Since  the  timing  of  such  events  is 
not  known  a  priori,  this  requires  coordination  algorithms  that  specify  event- 
driven,  asynchronous  responses  of  the  robotic  network.  We  deal  with  this 
issue  building  on  the  robotic  network  model  proposed  in  Chapter  3.  Our 
exposition  here  on  boundary  estimation  relies  on  [Susca  et  al.,  2008,  Susca, 
2007], 
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The  chapter  is  organized  as  follows.  The  first  section  of  the  chapter  ex¬ 
tends  the  synchronous  model  proposed  in  Chapter  3  to  include  the  event- 
driven,  asynchronous  operation  of  a  robotic  network.  The  second  section 
reviews  some  basic  facts  on  interpolation  theory  for  boundary.  In  the  third 
section,  we  introduce  the  Estimate  Update  and  Balancing  Law  to  solve 
the  boundary  estimation  task  and  analyzes  its  correctness.  We  end  the  chap¬ 
ter  with  three  sections  on,  respectively,  bibliographical  notes,  proofs  of  the 
results  presented  in  the  chapter,  and  exercises.  Throughout  the  exposition, 
we  make  extensive  use  of  polygonal  approximations,  geometric  decomposi¬ 
tions,  and  consensus  algorithms.  The  convergence  analysis  is  based  on  the 
LaSalle  Invariance  Principle  and  on  distributed  linear  iterations. 


6.1  EVENT-DRIVEN  ASYNCHRONOUS  ROBOTIC  NETWORKS 

In  what  follows  we  model  “event-driven  asynchronous”  robotic  networks. 
This  model  describes  groups  of  agents  that  reset  their  processor  states  upon 
certain  asynchronous  events,  i.e. ,  events  that  do  not  necessarily  happen  si¬ 
multaneously  for  all  agents.  For  example,  a  relevant  event  might  be  a  robot 
reaching  a  location  or  leaving  a  region.  The  following  event-driven  model  is 
convenient  to  describe  our  algorithm  for  boundary  estimation  but  its  appli¬ 
cability  extends  beyond  this  particular  scenario.  Following  our  discussion  of 
synchronous  robotic  networks  in  Section  3.1,  the  event-driven  model  consists 
of  the  following  ingredients:  a  robotic  network,  as  in  Definition  3.2,  and  an 
event-driven  control  and  communication  law,  as  defined  next. 

Definition  6.1  (Event-driven  control  and  communication  law).  An 

event-driven  control  and  communication  law  £CC  for  a  robotic  network  S 
consists  of  the  following  sets: 

(i)  A,  a  set  containing  the  null  element,  called  the  communication 
alphabet ;  elements  of  A  are  called  messages', 

(ii)  VPW,  i  G  1,  called  the  processor  state  sets ; 

(hi)  W%]  C  W*  1,  i  6  I,  sets  of  allowable  initial  values ; 

and  the  following  maps: 

(i)  (msg-trigW, msg-genW, msg-recW),  i  £  I,  called  the  message-trigger 
function,  message-generation  function,  and  message-reception  func¬ 
tion,  respectively,  such  that 


(a) 

msg-trigW 

:  X® 

X  W® 

— > {true 

, false} 

(b) 

msg-genM 

:  X® 

x  W® 

x  I  — ■>  A: 

i 

(c) 

msg-recW 

:  X® 

x  W® 

X 

> 

X 

1 — i 

i 

w®; 
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(ii)  (stf-trigj^ ,  stfj^ ) ,  i  £  I,  k  £  {1, . . . ,  called  the  kth  state-transition 
trigger  function  and  the  kth  (processor)  state-transition  function, 
respectively,  such  that 

(a)  stf-trigj^  :  XM  x  — >  {true,  false}, 

(b)  stfJJ  :  XW  x  W®  -> 

(iii)  ctlW  :  xW  x  —>  U^\  i  £  I,  called  (motion)  control  functions. 

If  5  is  uniform  and  if  =  W,  (msg-trig^ ,  msg-gen  M ,  msg-rec  ^ )  =  (msg-trig,  msg-gen,  msg-rec), 
Kstf  =  Ksth  (stf-trig^ ,  stfj^ )  =  (stf-trigfe,stffc),  for  k  £  {1, . . . ,  Xstf},  and 
ctlM  =  ctl,  for  all  i  £  I,  then  £CC  is  said  to  be  uniform  and  is  described  by 
the  tuple 

(A,  W,  {W^]}ia,  (msg-trig,  msg-gen,  msg-rec),  {stf-trigfc,  stffc}^*{,  ctl).  • 


Observe  that  a  key  difference  between  Definitions  3.9  and  6.1  is  that  the 
message-generation  and  state-transition  functions  are  substituted  by  sets  of 
maps  (msg-trig W,  msg-gen W,  msg-rec W)  and  (stf-trigj^,  stf^).  A  second  dif¬ 
ference  is  that  the  control  function  depends  only  upon  current  robot  position 
and  not  position  at  last  sample  time.  In  each  triplet  (msg-trigM ,  msg-gen W ,  msg-rec M ) , 
the  map  msg-trigM  acts  as  a  trigger  for  agent  i  to  send  a  message  to  its  neigh¬ 
bors,  the  map  msg-genW  computes  the  message  to  be  sent,  and  msg-recM 
specifies  how  agent  i  updates  its  processor  state  when  receiving  a  message. 

This  models  situations  in  which  the  agent  physical  and  processor  states 
need  to  satisfy  certain  constraints  before  a  message  should  be  sent.  In  hy¬ 
brid  systems  terminology  [van  der  Schaft  and  Schumacher,  2000],  the  map 
msg-trig^  can  be  seen  as  a  guard  map.  Similarly,  in  the  pair  (stf-trigj^ ,  stf^ ) , 

M 

the  map  stf-trig),  acts  as  a  trigger  for  agent  i  to  update  its  processor  state. 

If  several  stf-trig),  are  satisfied  at  the  same  time,  then  the  agent  can  freely 
choose  among  the  corresponding  state  transition  functions  to  update  the 
processor  state.  This  freedom  means  that  our  dynamical  system  is  described 
by  a  set-valued  map  and  leads  to  non-deterministic  evolutions. 

The  evolution  of  a  robotic  network  dictated  by  an  event-driven  control  and 
communication  law  is  asynchronous:  there  is  not  a  common  time  schedule 
for  all  robots  to  send  messages,  receive  messages,  and  update  their  processor 
states.  Only  when  an  event  happens,  an  agent  sends  a  message  or  updates 
its  state.  The  asynchronous  event-driven  evolution  can  be  loosely  described 
as  follows. 

(i)  Starting  from  the  initial  conditions,  the  physical  state  of  each  agent 
evolves  in  continuous  time  according  to  the  control  function  ctl^ ; 
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(ii)  at  every  instant  of  time  t\  £  M>o  such  that  the  message-trigger  func¬ 
tion  for  agent  i  satisfies  msg-trigW(xW(ti),u;W(ti))  =  true,  agent  i 
generates  a  non-null  message  according  to  msg-genW  and  sends  it 
to  all  its  out-neighbors.  At  time  t\,  each  out-neighbor  j  of  agent  i 
receives  a  messages  and  processes  it  according  to  msg-rec^l .  If  mul¬ 
tiple  messages  are  receives  at  the  same  time,  we  allow  all  possible 
orders  of  execution  of  the  message-reception  function; 

(iii)  additionally,  at  every  instant  of  time  t2  £  M>o  such  that  one  of  the 
state-transition  triggers  satisfies  stf-trigj^(xM  fa),  (£2),  2/^ (£2))  = 

true,  agent  i  updates  its  processor  state  uM  according  to  stfj^.  If 
multiple  state  transitions  are  triggered  at  the  same  time,  we  allow 
all  possible  orders  of  execution  of  the  state-transition  functions; 

(iv)  if  one  or  multiple  state-transition  and  message  triggers  are  equal  to 
true  at  the  same  time,  then  we  assume  that  all  state  transitions 
take  place  first,  and  immediately  after  the  messages  are  generated 
and  transmitted; 

(v)  if  one  or  multiple  state-transition  triggers  are  equal  to  true  at  the 
same  time  at  which  messages  are  received,  then  we  allow  all  possible 
orders  of  execution  of  the  state-transition  and  message-reception 
functions;  and 

(vi)  in  order  to  avoid  the  possibility  of  an  infinite  number  of  message 
transmissions  or  state  transitions  in  finite  time,  we  introduce  a 
“dwell  logic.”  Let  6  >  0  be  a  dwell  time  common  to  all  agents. 
For  each  agent  i,  if  a  message  was  generated  at  time  t\  by  agent  i, 
then  no  additional  message  is  to  be  generated  before  time  t±  +  e  by 
agent  i  independently  of  the  value  of  its  message-trigger  function. 
Similarly,  for  each  agent  i,  if  a  state-transition  function  was  executed 
at  time  t2  by  agent  i,  then  no  additional  state-transition  functions 
is  to  be  executed  before  time  t2  +  s  by  agent  i  independently  of  the 
value  of  its  state-transition-trigger  function. 

Remark  6.2  (Dwell  time  prevents  Zeno  behavior).  We  remark  that 
(1)  a  dwell  time  is  introduced  here  only  for  the  purpose  of  properly  defining 
an  execution  for  a  general  event-drive  control  and  communication  law;  (2) 
infinite  numbers  of  message  transmissions  or  state  transitions  in  finite  time 
will  not  take  place  during  the  execution  of  the  algorithm  we  present  later  in 
the  chapter;  and  (3)  further  details  on  so-called  Zeno  behavior  are  discussed 
by  van  der  Schaft  and  Schumacher  [2000],  Johansson  et  al.  [1999].  • 

Finally,  as  we  did  in  Chapter  3,  we  now  give  a  formal  definition  of  the 
asynchronous  evolution  of  an  event-driven  control  and  communication  law 
on  a  robotic  network. 
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Definition  6.3  (Asynchronous  event-driven  evolution  with  dwell 
time).  Let  £CC  be  an  event-driven  control  and  communication  law  for  the 
robotic  network  S.  For  5  £  M>o,  the  evolution  of  ( S,£CC )  with  dwell  time 
6  from  initial  conditions  xl0  £  Xq  and  wl0  £  Wq  ,  i  £  /,  is  the  collection  of 
absolutely  continuous  curves  x'W  :  M>o  — >  X^l\  i  £  /,  and  piecewise-constant 
curves  ;  M>0  — >  .  i  g  /,  such  that  at  almost  all  times 

i^(t)  =  /(x^(i),  ctl^(xW(t),u;W(t))),  (6.1.1) 

wW(f)  =  0,  (6.1.2) 


with  xW(0)  =  x$  and  u>M(0)  =  Wq\  i  £  /,  and  such  that 

(i)  for  every  i  £  /  and  fi  £  M>o,  a  message  is  generated  by  agent  i  and 
received  by  all  its  out- neighbors  j,  that  is, 

V[i\t  i)  =  msg-genW(xW(ti),u;W(ti),j), 

w^(ti)  =  msg-rec^  (x^(ti),  lim  (t) , (t\) ,  i) , 

if  msg-trigM(xM(fi)  ,  rc^(ii))  =  true  and  agent  i  has  not  trans¬ 
mitted  any  message  to  its  out-neighbors  during  the  time  interval 
]t\  —  5,  fi[nM>o.  Here,  agent  j  is  an  out-neighbor  of  agent  i  at  time 
1 1  if  (i  ,  j  )  £  F'cmm  (xM(fi), . . .  ,x[n](ti));  and 

(ii)  for  every  i  €  I,  k  €  {1, ,  A'^}  and  t2  £  M>o  the  state-transition 
function  stf).  is  executed,  that  is, 

w^{t2)  =  stf^  (xW(t2),  lim  u;W(t)), 

t  >t2 

if  stf-trigj^  (xW  (t2),w^  ](^2))  =  true  and  there  has  been  no  execution 
of  stfj*]  during  the  time  interval  ]t2  —  S,  t2[nM>o-  • 


6.2  PROBLEM  STATEMENT 

In  this  section,  we  formalize  the  network  objective.  We  begin  by  reviewing 
some  important  facts  on  interpolations  of  planar  boundaries  by  means  of  in¬ 
scribed  polygons.  After  introducing  the  robotic  network  model,  we  build  on 
the  notions  from  linear  interpolations  to  formalize  the  boundary  estimation 
task. 
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6.2.1  Linear  interpolations  for  boundary  estimation 

Consider  a  simply  connected  set  Q  in  R2  that  we  term  body.  We  are  inter¬ 
ested  in  obtaining  a  description  of  the  boundary  dQ  of  a  convex  body.  In 
particular,  we  will  consider  the  symmetric  difference  error  metric  [Gruber, 
1983]  to  measure  the  goodness  of  an  approximation  to  dQ.  The  symmetric 
difference  5s  between  two  compact  bodies  C,  B  CM2  is  defined  by 

5S(C,  B)  =  n{C  U  B)-  [i(C  n  B), 

where  p,  is  the  Lebesgue  measure  on  M2.  This  definition  is  illustrated  in 
Figure  6.1.  We  note  that  the  symmetric  difference  admits  alternative  defini¬ 
tions,  see  Exercise  E6.1.  In  what  follows,  we  will  search  for  approximations 


Figure  6.1  The  symmetric  difference  between  the  two  quadrilaterals  is  the  area  corre¬ 
sponding  to  the  region  colored  in  light  gray. 

to  a  convex  body  Q  by  means  of  inscribed  polygons.  A  polygon  is  inscribed 
in  Q  if  all  its  vertices  belong  to  the  boundary  of  Q.  We  denote  an  inscribed 
polygon  with  m  vertices  by  Qm.  The  symmetric  difference  between  the  body 
Q  and  the  polygon  Qm  takes  the  simpler  form 

dS(Q,  Qm)  =  p(Q)  -  p(Qm)- 


The  inscribed  polygons  that  are  critical  points  of  5s  can  be  characterized 
as  follows.  For  convenience,  we  denote  by  t  :  R2  — >  R3  the  inclusion  map 
defined  by  i(q)  =  (q,  0). 

Lemma  6.4  (Characterization  of  critical  inscribed  polygons  for  the 
symmetric  difference).  Let  Q  be  a  convex  body  and  let  7  :  [0,  27t]  — »  R2  be 
a  continuously  differentiable  parameterization  of  dQ.  Let  Qm  be  an  inscribed 
polygon  with  vertices  {q\ , . . . ,  qm}  in  counterclockwise  order.  Let  Sj  G  [0,  27r] 
be  such  that  qj  =  7 (sj),  for  j  G  {1, . . . ,  m}.  Then,  Qm  is  a  critical  point  of 
5s  if  and  only  if 

LW{sj))  x  I'faj+i  -  qj-i)  =  O3,  /ora/lj6{l,...,m},  (6.2.1) 
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where  q0  =  qm  and,  qm+ i  =  q\ . 

Note  that  the  characterization  of  critical  inscribed  polygons  in  Lemma  6.4 
can  be  satisfied  not  only  by  polygons  that  are  maximizers  but  also  by  sad¬ 
dle  points,  see  Exercise  E6.2.  On  the  other  hand,  we  would  like  to  make 
use  of  a  characterization  that  can  be  extended  to  nonconvex  bodies  and 
that  relies  as  much  as  possible  on  local  information  that  agents  can  col¬ 
lect.  In  what  follows,  we  describe  the  method  of  empirical  distributions, 
based  on  the  asymptotic  formula  of  Lemma  6.5.  This  method  was  pro¬ 
posed  by  McLure  and  Vitale  [1975]  to  obtain  good  polygon  approximations 
to  convex  bodies,  and  admits  an  extension  to  nonconvex  bodies  as  we  discuss 
next. 

Let  7arc (q)  be  the  arc- length  parametrization  of  dQ  and  6  be  the  angular 
position  in  a  polar  variable  parameterization,  7poiar($),  of  dQ.  Assume 
that  both  parametrizations  transverse  dQ  in  a  counterclockwise  manner. 
Let  p  and  Kabs  be  respectively  the  curvature  radius  and  curvature  of  the 
boundary  (cf.  Section  1.1).  For  convex  bodies,  the  following  result  is  proved 
in  [McLure  and  Vitale,  1975,  Gruber,  1983]. 

Lemma  6.5  (Optimal  polygonal  approximation  of  a  convex  body). 

Let  Q  be  a  convex  body  in  R2  whose  boundary  is  of  class  C2  with  strictly 


positive  curvature  Kabs-  Let  Q*m  denote  an  optimal  approximating  polygon 
ofQ.  Then 


To  compute  an  optimal  approximating  polygon  for  a  strictly  convex  body,  McLure  and  Vitale 
[1975]  suggest  the  following  method  of  empirical  distributions.  Let  qi, . . .  ,qm 
be  consecutive  points  on  dQ  ordered  counterclockwise,  such  that  qt  =  7arc (6i), 
and  qi  =  7 (Qi),  for  i  E  { 1, . . .  ,m }.  According  to  the  method  of  empirical 
distributions,  the  positions  qi,  i  E  {1, . . . ,  m},  of  the  vertices  along  dQ  have 
the  property  that 


r0i+ 1  , 

/  p(0)2^d0 

Jdi 


(6.2.2) 


has  the  same  value  for  every  consecutive  pair  of  indices  (i,i  +  1),  with 
i  E  {1, . . .  ,m}.  Interpolating  polygons  computed  according  to  this  method 
will  converge  to  an  optimal  polygon  approximation  Q*m  as  m  tends  to  in¬ 
finity.  Roughly  speaking,  this  property  translates  into  the  placement  of 
more  interpolation  points  on  those  parts  of  the  boundary  that  have  higher 
curvature.  Figure  6.2  illustrates  an  optimal  approximating  polygon. 
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Figure  6.2  Equidistant  interpolation  points  according  to  the  integral  in  equation  (6.2.2). 

The  solid  line  represents  the  boundary  and  the  dashed  line  represents  the 
optimal  approximating  polygon. 

Let  us  finish  this  section  by  adapting  the  above  discussion  to  the  setup 
of  nonconvex  bodies.  Given  a  curve  7  :  [0, 1]  — >  M2,  we  define  an  inflection 
point  to  be  7(s0)  such  that  sign(Atsigned(s0  -  e))  /  sign(Atsigned(.so  +  e))  for 
every  e  E  M>o  sufficiently  small.  For  nonconvex  bodies  with  a  finite  number 
of  inflection  points  whose  boundary  can  be  parameterized  by  a  continuously 
differentiable  curve,  the  method  of  empirical  distributions  also  yields  nearly 
optimal  distributions  as  m  tends  to  infinity  because  of  the  local  convexity  of 
the  body  away  from  inflection  points.  However,  since  the  curvature  radius 
may  be  unbounded  at  some  points  of  a  nonconvex  boundary,  it  may  exist 
i  such  that  the  integral  (6.2.2)  is  unbounded.  We  avoid  this  problem  by 
considering  the  following  general  notion  of  distance  along  a  boundary.  For 
any  two  points  %  =  7 arc(ft)  and  qj  =  7arc (Qj),  with  <  Qj,  define 

L^curvatur  c(ft;ft)  —  f  ^abs  (7arc  (q)  )  ^  fob 
J  Si 

£{qi,Qj)  =  Qj  -  Qi- 

Note  that  the  above  quantities  are  positive  only  when  dQ  is  transversed 
counterclockwise  going  from  qt  to  qy  In  the  sequel,  we  will  be  implictly 
assuming  this  is  the  case.  On  the  other  hand,  note  that  p(9)ad6  = 
Jqq  Kabs(7arc)1_“d7arc  for  a  >  0,  see  [  Iruber,  1983].  Thus,  the  first  formula 
is  equivalent  to  (6.2.2).  For  A  E  [0, 1],  define  now  the  pseudo-distance  V\ 
between  the  vertices  qi  and  qj  as 

j  Qi+l )  —  ^  ^curvature  (ft;  9j)  "F  (1  /^)-^-'(ft;  ft+l)- 

The  criterion  of  the  method  of  the  empirical  distributions  will  be  substi¬ 
tuted  by  the  following  one  when  the  boundary  is  nonconvex.  We  will  look  for 
approximations  of  dQ,  {gi, . . . ,  qn},  such  that  T>\(qi,  ft+i)  have  all  the  same 
value  for  every  i  E  {1, . . .  ,n}.  This  choice  has  the  following  interpretation. 
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Taking  A  ~  1  leads  to  an  interpolation  that  almost  satisfies  the  criterion  of 
the  method  of  empirical  distributions.  However,  for  those  portions  of  the 
boundary  where  ftabs(7arc)  is  zero,  the  method  is  ill  defined.  On  the  other 
hand,  taking  AwO  leads  to  an  interpolation  that  divides  the  boundary  in 
equal  arclength  segments.  A  choice  of  A  G  (0, 1)  will  lead  to  a  polygon 
approximation  that  is  midway  between  these  two  options.  For  A  >  1/2, 
the  polygon  will  still  have  a  higher  number  of  vertices  in  the  portions  of 
the  boundary  with  higher  curvature,  and  we  guarantee  that  the  distance 
between  two  different  interpolation  points  is  always  positive. 


6.2.2  Network  model  and  boundary  estimation  task 


Next,  we  formulate  a  robotic  network  model  and  the  boundary  estimation 
objective.  Assume  that  Q  is  a  simply  connected  subset  of  R2  with  differ¬ 
entiable  boundary  dQ.  Consider  the  network  <Sbndry  =  (/,  77.,  £cmm),  with 
I  =  {1, ...  ,n}.  In  this  network,  each  robot  is  described  by  a  tuple 

(dQ,  [-1W,  umax],  dQ,  (0,  e)),  (6.2.3) 

where  e  is  the  vector  field  tangent  to  dQ  describing  counterclockwise  motion 
at  unit  speed;  we  assume  that  unit  speed  is  an  admissible  speed,  that  is, 
we  assume  1  £  [— umin,  umax].  We  assume  each  robot  can  sense  its  own 
location  £  dQ,  i  £  I,  and  can  communicate  with  its  clockwise  and 
counterclockwise  neighbors  along  dQ.  In  other  words,  the  communication 
graph  Ecmm  is  the  ring  graph  or  the  Delaunay  graph  on  dQ.  Later  we  shall 
assume  that  Q  varies  in  a  continuously  differentiable  way  with  time,  and 
that  therefore,  agents  move  along  its  time-varying  boundary. 


Next,  assume  that  the  processor  state  of  each  agent  contains  a  set  of  mp 
interpolation  points  used  to  approximate  dQ,  that  is,  the  processor  state  is 
given  by  £  (R2)n‘p,  for  i  £  I.  We  illustrate  the  combination  of  agents 
and  interpolation  points  along  the  boundary  in  Figure  6.3. 


For  e  £  R>o  and  A  £  [0, 1],  the  boundary  estimation  task  Te_ bndry  :  ( dQ)nx 
((M2)?lip)ri  — >  {true,  false}  for  5bndry  is  the  coordination  task 


^-bndry(p[11,  •  ■  •  , ,  9[1] ,  •  •  •  ,«N)  =  true 


V\  (il  l ,  Q[a] )  -  V\  ( Q[a] >  Qa+1 )  ) 


<  e, 


if  and  only  if 

a  £  {1, ... ,  nip}  and  i  £  I. 


Roughly  speaking,  this  task  is  achieved  when  the  7iip  interpolation  points 
are  approximately  uniformly  placed  along  the  boundary  according  to  the 
counterclockwise  pseudo-distance  T>\. 


A  second  objective  is  our  desire  to  space  the  agents  equally  far  apart 
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Figure  6.3  Agents  and  interpolation  points  on  the  boundary  dQ. 


along  the  boundary.  As  in  Example  3.22,  for  e  >  0,  we  define  the  agent 
equidistance  task  TLeqdstnc  :  ( dQ)n  — >  {true,  false}  to  be  true  if  and  only 
if 

p1^)  —  <  e,  for  all  i  £  I, 

where  C  is  the  counterclockwise  arc- length  distance  along  dQ.  In  other 
words,  ^-eqdstnc  is  true  when,  for  every  agent,  the  (unsigned)  distance  to 
the  closest  clockwise  neighbor  and  to  the  closest  counterclockwise  neighbor 
are  approximately  equal. 

6.3  ESTIMATE  UPDATE  AND  CYCLIC  BALANCING  LAW 

Here  we  propose  a  coordination  algorithm  for  a  robotic  network  to  achieve 
the  boundary  estimation  task.  The  algorithm  makes  individual  agents  pro¬ 
duce  and  continuously  update  an  approximation  of  the  boundary  that  asymp¬ 
totically  meets  the  criterion  of  the  method  of  empirical  distributions.  The 
algorithm  is  an  event-driven  control  and  communication  law,  as  defined  in 
Section  6.1.  To  facilitate  the  understanding,  the  algorithm  is  presented  in  an 
incremental  way.  First,  we  specify  an  estimate  update  law  for  a  single  robot. 
Second,  we  consider  multiple  robots  cooperatively  performing  the  estimate 
update  law  to  achieve  the  boundary  estimation  task.  Third  and  finally,  we 
introduce  a  cyclic  balancing  algorithm  to  achieve  a  robot  equidistance  task. 


6.3.1  Single-robot  estimate  update  law 

Let  Q  be  a  simply  connected  subset  of  R2  with  a  differentiable  moving 
boundary  dQ.  Consider  a  single  robot  described  by  (6.2.3)  that  moves 
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along  dQ.  Assume  the  processor  state  contains  a  set  of  interpolation  points 
{q\ , . . . ,  qn ip}  used  to  approximate  dQ.  We  begin  with  an  informal  descrip¬ 
tion  of  the  Single-Robot  Estimate  Update  Law  and  we  illustrate  in 
Figure  6.4  the  two  actions  characterizing  this  law. 


[Informal  description]  The  agent  moves  counterclockwise  along 
the  moving  boundary  dQ  collecting  estimates  of  its  tangent  and 
curvature.  Using  these  estimates,  the  agent  executes  the  fol¬ 
lowing  two  actions:  First,  it  updates  the  positions  of  the  in¬ 
terpolation  points  so  that  they  take  value  on  the  estimate  of 
dQ.  In  other  words,  as  sufficient  information  is  available,  each 
interpolation  point  qa,  a  E  {l,...,nip},  is  projected  onto  the 
estimated  boundary.  Second,  after  an  interpolation  point  qa  has 
been  projected,  the  agent  collects  sufficient  information  so  that 
it  can  locally  optimize  the  location  of  qa  along  the  estimate  of 
dQ.  Here  by  an  estimate  of  the  time- varying  dQ,  we  mean  the 
trajectory  of  the  agent  along  the  moving  boundary. 


Figure  6.4  The  two  actions  characterizing  the  Single-Robot  Estimate  Update  Law. 

Next,  we  begin  our  detailed  description  of  the  algorithm  by  specifying 
what  variables  the  agent  maintains  in  its  memory.  The  processor  state  of 
the  agent  consists  of  the  following  variables: 

(i)  a  counter  nxt  taking  values  in  {1, . . .  ,n-ip}  that  specifies  which  in¬ 
terpolation  point  the  agent  is  going  to  project  next; 

(ii)  a  boundary  representation  comprised  of  the  pairs 

{{qa,va)  G  (M2)2  |  a  G  {1, ... ,  nip}}, 

where  qa  is  the  position  of  the  interpolation  point  a  and  va  repre¬ 
sents  the  tangent  vector  of  dQ  at  qa ; 

(iii)  a  curve  of  the  form  path  :  [0 ,t]  — >  R2.  This  curve  is  the  trajectory 
followed  by  the  agent  from  initial  time  until  present  time  t.  We 
assume  that  the  agent  updates  the  variable  path  continuously.  We 
let  C(M2)  be  the  set  of  planar  curves,  that  is,  twice-differentiable 
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functions  from  an  interval  to  M2.  With  this  notation  we  may  write 
path  E  C(M2). 


Figure  6.5  The  agent  moves  along  the  time- varying  boundary  dQ,  here  depicted  as  a 
sequence  of  growing  ellipses,  and  its  trajectory  is  an  approximation  to  dQ. 

Remark  6.6  (Boundary  approximation).  For  simplicity,  we  assume 
that,  at  every  instant  of  time,  the  agent  is  located  exactly  on  top  of  the 
boundary.  This  assumption  implies  that,  if  the  boundary  is  time-invariant, 
then  the  agent  trajectory  path  is  locally  equal  dQ.  Furthermore,  if  the 
boundary  is  slowly  time- varying,  then  the  agent  trajectory  path  is  an  esti¬ 
mate  of  the  moving  boundary  dQ ,  as  illustrated  in  Figure  6.5.  • 

The  agent  updates  its  processor  state  according  to  the  following  two  rules: 

Rule  #1:  When  and  how  to  project  onto  dQ  the  interpolation  point  qnx t. 
Let  qnxt  denote  the  interpolation  point  about  to  be  projected  and  unxt  denote 
the  corresponding  tangent  vector.  The  projection  takes  place  when  the 
agent  crosses  the  line,  denoted  by  linenxt,  that  passes  through  qnxt  and  is 
perpendicular  to  rnxt.  At  this  crossing  time,  we  define  the  updated  values  for 
the  interpolation  point  nxt,  denoted  by  q+x t,  to  be  the  point  on  path  where 
the  agent  trajectory  crosses  the  line  linenxt.  This  projection  operation  is 
illustrated  in  Figure  6.6.  We  refer  to  this  operations  by  the  map  perp-proj  : 


Figure  6.6  Projection  of  interpolation  point  gnxt  onto  the  curve  path. 

(M2)2  x  C(R2)  ->  M2  ;  in  other  words  we  write 

Qnxt  :  =  perp-proj  (qnxt,vnxt,  path). 
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Rule  # 2 :  When  and  how  to  optimize  the  interpolation  point  (/nxt-i  ■  The 
local  optimization  of  the  interpolation  point  (nxt  —  1)  takes  place  immedi¬ 
ately  after  the  projection  of  the  interpolation  point  nxt  onto  the  estimated 
boundary.  The  interpolation  point  (nxt  —  1)  is  moved  along  the  curve  path 
in  order  to  balance  its  two  pseudodistances  to  its  clockwise  and  counter¬ 
clockwise  neighboring  interpolation  points  (recall  that  path  is  an  estimate 
of  the  boundary  dQ  as  discussed  in  Remark  6.6).  Specifically,  we  define  the 
map  cyclic-balance  :  (R2)3  X  C(R2)  — >  R2  by 

cyclic-balance(gnxt_2,  (foxt-i,  Qnxt,  path)  is  point  q*  in  the  curve  path 

3  1 

such  that  TR((/nxt— 2j  Q  )  =  ~^R^\{Qnxt— 2j  Qnxt—  l)  T  1>  Qnxt)  • 

This  map  is  illustrated  in  Figure  6.7.  With  this  definition,  we  update  the 
interpolation  (nxt  —  1)  to  be 

<?nxt-i  :=  cyclic-balance(gnxt_2,gnxt-i,<?nxt,path). 


Figure  6.7  Optimal  placement  of  the  interpolation  point  qnx t-i  along  the  curve  path. 


Remark  6.7  (Balancing  property  of  the  optimal  placement).  The 

optimal  placement  q+xt_i  can  be  equivalently  defined  by 

1  3 

^^(llnxt  — 1>  f/nxt)  =  ^TR(<?nxt-2,  Qnxt-l)  +  (f/nxt  —  1  j  Qnxt )  > 

so  that  the  it  achieves  the  balancing  property  that 


T,A(Q,nxt-2)  g+xt-l) 

1 

'3 

1' 

'D\(Qnxt—2->  Qnxt  —  l) 

^■^(^nxt—li  9nxt) 

“  4 

1 

3 

'D\(Qnxt  —  li  Qnxt)  • 

This  iteration  is  the  same  as  the  cyclic  balancing  system  with  parameter 
k  =  1/4  studied  in  Exercises  El. 29,  E5.4,  and  E6.3.  • 

Finally,  we  define  one  last  useful  operation.  Given  a  point  on  the  curve 
path  it  is  useful  to  be  able  to  compute  the  tangent  of  the  curve  path  at  the 
point  q.  Specifically,  given  a  point  q  in  the  curve  path,  we  shall  write 

v  :=  tangent  at  (path,  q). 
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In  summary,  the  Single-Robot  Estimate  Update  Law  is  formally  de¬ 
scribed  as  follows. 


Robot:  single  robot  moving  at  constant  speed  along  dQ, 
continuously  recording  its  trajectory 

Event-driven  Algorithm:  Single- Robot  Estimate  Update  Law 
Processor  State:  w  =  (nxt,  {((fa,  va)y^T^  path),  where 

nxt  E  {1, . . . ,  n ip},  initially  equal  to  index  of  the  interpo¬ 

lation  point  closest  to  robot  moving 
counterclockwise 

{{(lai* (i) (ii) * * va)y^=i  C  l2  x  I2,  initially  counterclockwise  along  boundary 
path  E  C(M2),  continuously  recording  agent’s  trajectory 

%  A  state  transition  is  triggered  when  the  agent  crosses  a  certain  line 
function  stf-trig(p,  w) 

i:  linenxt  :=  line  through  point  qnxt  perpendicular  to  direction  unxt 
2:  if  p  E  linenxt  then 
3:  return  true 

4:  else 

5:  return  false 

%  The  current  interpolation  point  and  tangent  vector  are  projected  and  the 
previous  interpolation  point  is  optimized  along  the  new  boundary 
function  stf (p,w) 

1:  -={{Qa,Va)}a= 1 

2:  Qnxt  :=  perp-proj  (qnxt,  wnxt,  path) 

3:  dnxt-i  :=  cyclic-balance(qnxt_2,  <?nxt-i)  Qnxti  path) 

4:  vnxt  :=  tangentat(path ,  q+t) 

5:  vnxt-i  :=  tangentat(path,g+t_i) 

6:  return  (nxt  +  1,  {(<?+,  1;+)}^,  path) 


The  Single- Robot  Estimate  Update  Law  may  be  improved  in  a  num¬ 
ber  of  ways;  here  we  present  some  important  algorithmic  clarifications. 


Remarks  6.8  (Content  and  representation  of  the  variable  path). 

(i)  The  above  discussion  assumes  that,  with  the  information  provided 
by  the  variable  path,  the  agent  can  compute  the  tangent  vector  and 
the  curvature  along  its  trajectory  in  order  to  perform  the  calculation 
of  the  pseudodistance  V\. 

(ii)  It  is  not  necessary  for  the  agent  to  keep  in  the  variable  path  its 

entire  trajectory  since  initial  time.  In  fact,  when  the  agent  updates 
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the  location  of  the  interpolation  point  (nxt  —  1)  in  instruction  4:  of 
the  state-transition  function,  it  is  sufficient  that  path  contains  the 
trajectory  of  the  agent  starting  from  interpolation  point  (nxt  —  2) 
until  the  current  agent  position.  This  “limited-length”  requirement 
may  be  implemented  as  follows:  the  variable  path  is  a  curve  of  the 
form  path  :  [tpath,  t]  — >  M2,  the  variable  t  denotes  the  present  time, 
the  variable  tpath  is  initially  set  equal  to  0,  and  the  instruction 

tpath  :=  t*  such  that  qnxt-i  =  path (f*), 
is  executed  before  instruction  6:  in  the  state-transition  function. 

(iii)  In  a  realistic  implementation,  the  path  variable  and  its  first  two 
derivatives  are  to  be  represented  with  finite  resolution  over  a  discrete 
time  domain.  The  interested  reader  is  referred  to  [Susca  et  al.,  2008] 
for  a  discussion  of  the  Single-Robot  Estimate  Update  Law 
algorithm  with  a  realistic  implementation  of  the  path  variable  in  a 
finite-length  finite-resolution  manner.  • 

Remark  6.9  (Timeout  for  the  projection  of  interpolation  points). 

In  the  definition  of  Single-Robot  Estimate  Update  Law,  we  have  im¬ 
plicitly  assumed  that  the  agent  crosses  the  line  through  qnxt  perpendicular 
to  vnxt  •  This  is  certainly  the  case  if  dQ  is  static  or  slowly  time- varying.  How¬ 
ever,  if  dQ  changes  drastically,  it  is  conceivable  that  the  agent  never  crosses 
the  line  through  qnxt  perpendicular  to  unxt .  In  other  words,  it  can  happen 
that  the  state-transition  trigger  function  is  always  false.  This  situation 
can  be  prevented  by  prescribing  a  timeout  such  that,  if  the  agent  has  not 
crossed  the  line  after  certain  time  has  elapsed,  then  the  interpolation  point 
nxt  is  projected  onto  its  trajectory  anyway.  Formally,  let  t*  be  implicitly 
defined  by 

-1>P(0)  =  2X>A(<7nxt  — l,<7nxt)- 

If  no  crossing  has  happened  at  time  t,  then  q^xt  is  set  equal  to  the  point 
on  path  that  is  closest  to  qnxt.  The  tangent  vector  is  set  equal  to 
tangentat(path,  q+x t).  This  projection  is  well-defined  and  has  the  property 
that  if  dQ  is  time- invariant,  then  q+xt  =  qnxt.  The  algorithm  described 
by  Susca  et  al.  [2008]  explicitly  incorporates  this  timeout.  • 


6.3.2  Cooperative  estimate  update  law 

In  the  previous  section  we  presented  an  event-driven  algorithm  for  a  single 
robot  to  monitor  a  boundary.  We  consider  the  robotic  network  5t>ndry  wifi1 
ring  communication  topology,  described  in  Section  6.2.2,  and  we  develop  a 
parallel  version  of  the  Single- Robot  Estimate  Update  Law  that  allows 
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the  network  to  monitor  the  boundary  efficiently  (i.e. ,  with  more  accuracy 
than  a  single  robot  could).  We  begin  with  an  informal  description  of  the 
Cooperative  Estimate  Update  Law. 

[Informal  description]  Each  agent  moves  counterclockwise  along 
the  moving  boundary  dQ,  has  its  individual  copy  of  the  bound¬ 
ary  representation,  including  the  interpolation  points,  and  exe¬ 
cutes  the  Single- Robot  Estimate  Update  Law.  Because  the 
agents  are  spatially  distributed,  each  agent  updates  its  individ¬ 
ual  boundary  representation  separately.  On  top  of  the  SlNGLE- 
Robot  Estimate  Update  Law,  the  agents  run  a  communi¬ 
cation  protocol  that  transmits  the  updated  interpolation  points 
along  the  ring  topology.  Specifically,  every  time  an  agent  updates 
two  interpolation  points  (using  the  perp-pro  j  and  cyclic-balance 
functions  in  the  state-transition  function  of  Single- Robot  Es¬ 
timate  Update  Law),  this  agent  transmits  these  updated  in¬ 
terpolation  points  to  its  clockwise  and  counterclockwise  neigh¬ 
bors.  In  turn,  the  neighbors  record  the  updates  in  their  individ¬ 
ual  boundary  representation. 


Next,  we  give  a  more  detailed  description  of  the  algorithm.  We  assume 
each  robot  i  has  a  processor  state  with  its  local  copy  of  w W  containing  a 
counter  nxtM,  a  boundary  representation  {((/a  ,  )}a=i,  where  n;p  is  equal 

for  all  robots,  and  its  pathM.  The  Cooperative  Estimate  Update  Law 
is  formally  described  as  follows. 

Robotic  Network :  5bndry>  first-order  agents  moving  at  unit  speed  along  dQ 
with  absolute  sensing  of  own  position,  communicating 
with  clockwise  and  counterclockwise  neighbors 

Event-driven  Algorithm:  COOPERATIVE  ESTIMATE  UPDATE  Law 

Alphabet:  A  =  {1,  •  •  • ,  Rip}  X  (R2)2  x  (R2)2  Ujnull} 

Processor  State,  function  stf-trig,  and  function  stf 

same  as  in  Single- Robot  Estimate  Update  Law 

%  A  transmission  is  triggered  right  after  the  interpolation  points  are  updated 
function  msg-trig(p,  w) 
i:  return  stf-trig(p,  w) 

%  The  updated  interpolation  points  ( and  reference  label)  are  transmitted 
function  msg-gen(p,  w,  i) 

i:  return  (nxt,  (qnx t-i,  unxt-i),  ( <?nxt-2 ,  Wxt-2)) 

%  The  received  updated  interpolation  points  are  stored 
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function  msg-rec(p,  w,  y,  i ) 

1=  {(<£,*£)}£= 1  =  ={(?<*,  ««)}£=  1 
2:  (nxtrec,  yi,  y2)  :=  V 

3:  (^nxtrec— 1’  ^nxtrec— l)  :=  2/1 
(^nxtrec— 2>  Uixtrec— 2)  '  2/2 

5:  return  (nxt,  {(g+,  u+)}^1,  path) 


We  conclude  this  section  with  an  important  clarification.  We  begin  with 
a  useful  definition  and  then  give  two  related  remarks. 

Definition  6.10  (Two-hop  separation).  A  group  of  n  >  2  agents  imple¬ 
menting  the  Cooperative  Estimate  Update  Law  is  two-hop  separated, 
along  the  interpolation  points  if  nxtt1-1!  <  nxtW  —  2  for  all  i  6  I  at  all  times 
during  the  algorithm  execution.  • 


Remarks  6.11  (Well-posedness  of  the  Cooperative  Estimate  Up¬ 
date  Law). 

(i)  The  inequality  nxd®-1]  <  nxt  M  —  2  guarantees  that  each  robot  can 

correctly  implement  the  algorithm.  Indeed,  if  this  inequality  is  vi¬ 
olated,  then  the  cyclic-balance  function  performed  by  robot  i 
during  the  state-transition  function  is  invoked  with  an  interpola¬ 
tion  point  which  does  not  take  value  in  the  curve  pathM 

(because  the  boundary  might  be  time- varying) .  This  inequality 
therefore  guarantees  that  the  algorithm  is  well-posed.  Assuming 
two- hop  separation  guarantees  that  the  projection  and  optimiza¬ 
tion  events  happen  in  the  following  order:  each  interpolation  point 
nxt M  is  projected  and  later  optimized  by  robot  i,  strictly  before  it 
is  projected  by  robot  (i  —  1). 

(ii)  The  two- hop  separation  property  is  easily  seen  to  hold  when  (1)  the 

number  of  interpolation  points  nip  is  much  larger  than  the  num¬ 
ber  of  robots  n,  (2)  the  robots  are  approximately  equidistant  along 
dQ,  and  (3)  the  distances  between  pairs  of  consecutive  interpolation 
points  are  much  less  then  the  length  of  dQ  divided  by  n.  • 


6.3.3  Cyclic  balancing  algorithm  for  agent  equidistance  task 


Here  we  design  a  motion  coordination  controller  to  achieve  the  agent  equidis¬ 
tance  task  "Te-eqdstnc  among  the  agents  moving  along  the  boundary.  This 
task  also  leads  to  the  orderly  interactions  mentioned  in  the  last  remark. 
Specifically,  we  extend  the  Cooperative  Estimate  Update  Law  to  in- 
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elude  a  motion  coordination  component  that  makes  the  agents  achieve  the 
agent  equidistance  task  while  moving  at  approximately  unit  speed  along  the 
boundary.  The  control  design  is  straightforward:  for  robot  i  at  position  pH 
moving  in  continuous  time  with  speed  v  ^  along  dQ,  we  define 

=  1  +  kpmp(£(p[l\p[t+1])  -  £(p[*"1],pW)),  (6.3.1) 

where  &:prop  G  M>o  is  a  fixed  control  gain.  In  other  words,  the  agent  speeds 
up  or  slows  down  depending  upon  whether  it  is  closer  to  the  following 
or  to  the  preceding  agent,  respectively.  This  simple  motion  control  law 
is  the  continuous-time  analog  of  the  cyclic  balancing  system  described  in 
Exercise  El. 29;  recall  that  this  system  was  adopted  also  in  the  Single- 
Robot  Estimate  Update  Law  for  the  purpose  of  balancing  pseudodis¬ 
tances  among  interpolation  points. 


To  handle  the  lower  and  upper  bounds  constraints  on  the  velocity,  that 
is,  the  constraint  v  G  [— umin,  umax],  we  introduce  a  saturation  function  in 
the  design  (6.3.1).  Specifically,  we  implement 

=  sat[<w„,<w]  i1  +  kprop (C(p[l],p[l+1])  -  £(p[t-1],p M)))  ,  (6.3.2) 


where  the  saturation  function  sat 


sat[0)6](s)  =  < 


a,b]  ■  ^  - 

->  [a,  6] ,  for  a  <  b,  is  defined  by 

a, 

if  x  <  a, 

X, 

if  x  G  [a,  6], 

b, 

if  x  >  b. 

The  difficulty  in  implementing  controller  (6.3.2)  in  the  Cooperative 
Estimate  Update  Law  is  how  to  measure  the  counterclockwise  arc-length 
distance  between  robots.  To  tackle  this  difficulty,  let  us  begin  with  a  useful 
observation.  Given  the  interpolation  points  {^i, . . . ,  qUip}  and  two  points 
on  the  boundary  r  \ ,  r2,  assume  sufficient  information  is  available  to  com¬ 
pute  the  indices  [tt]  and  [r2]  of  the  counterclockwise-closest  interpolation 
points  from  n,  r2,  respectively.  With  these  notations  and  assumption,  the 
counterclockwise  pseudodistance  from  r\  to  r2  and  one  of  its  possible  ap¬ 
proximations  are 

[r2]— 2 

£(ri,r2)  =  £(ri,q[ri])  +  ^  C(qa,qQ+ i)  +  C(q[r2]-i,r2)  (6.3.3) 

a=[ri] 

[r2]-2 

«  dist2(ri,g[ri])  +  ^2  dist2(ga,ga+i) +  dist2(g[r2]_i,r2).  (6.3.4) 

a=[ri] 

Based  on  this  equality  and  on  this  approximation,  we  propose  two  methods 
to  implement  the  controller  (6.3.2).  One  may  implement  either 
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(i)  the  approximation  proposed  in  (6.3.4);  note  that  this  approximated 
pairwise  counterclockwise  arc-length  distance  may  be  computed  with 
the  information  available  to  the  agents  in  Cooperative  Estimate 
Update  Law;  or 

(ii)  the  exact  computation  proposed  in  (6.3.3);  in  order  to  perform  this 
computation  however  the  robots  require  more  information.  The  pro¬ 
cessor  state  needs  to  store  a  vector  of  arc- length  distances  C(qa,  qa+ 1), 
a  £  {1, . . . ,  nip}  that  are  measured  by  the  agents  as  they  move  and 
that  are  maintained  accurate  via  communication.  In  the  interest  of 
brevity  we  omit  a  detailed  discussion  of  this  point  here. 

Finally,  independently  of  the  computation  or  approximation  of  the  arc- 
length  distances,  the  implementation  of  controller  (6.3.2)  requires  each  agent 
to  have  a  continuous-time  estimate  of  the  location  of  its  clockwise  and 
counterclockwise  neighbors:  this  information  may  be  acquired  by  either  a 
dedicated  message-exchanging  protocol  or,  possibly,  by  proximity  sensors 
mounted  on  the  robots.  In  the  interest  of  brevity  we  omit  a  detailed  discus¬ 
sion  of  this  point  here. 


6.3.4  Correctness  of  the  estimate  update  and  cyclic  balancing  law 

We  call  Estimate  Update  and  Balancing  Law  the  combination  of  the 
Cooperative  Estimate  Update  Law  with  the  cyclic  balancing  control 
law  (6.3.2)  with  exact  arc-length  distance  computation  between  robots. 

We  call  Approximate  Estimate  and  Balancing  Law  the  combina¬ 
tion  of  the  Cooperative  Estimate  Update  Law  with  the  cyclic  balancing 
control  law  (6.3.2)  with  (1)  finite-resolution  finite-length  representation  of 
the  path  variable  in  the  robots,  (2)  approximate  arc-length  distance  com¬ 
putation  between  robots. 

We  state  the  properties  of  these  laws  in  the  following  theorem,  whose 
proof  is  postponed  to  Section  6.6. 

Theorem  6.12  (Correctness  of  the  exact  and  approximate  laws). 

On  the  network  5bndry>  along  evolutions  with  the  two-hop  separation  prop¬ 
erty, 

(i)  the  Estimate  Update  and  Balancing  Law  achieves  the  bound¬ 
ary  estimation  task  Te_\,n dry  and  the  agent  equidistance  task  Te_e qdstnc 
for  any  e  £  M>o  if  the  boundary  is  time-independent,  and 

(ii)  the  Approximate  Estimate  and  Balancing  Law  achieves  the 
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boundary  estimation  task  7^_bndry  and  the  agent  equidistance  task 
Ts_e qdstnc  for  some  e  G  M>o  if  the  boundary  varies  in  a  continuously 
differentiable  way  and  sufficiently  slowly  with  time,  and  its  length  is 
upper  bounded. 

Remark  6.13  (Error  induced  by  the  evolution  of  the  boundary  and 
its  discretization).  In  the  second  statement  in  the  theorem  the  constant 
e  depends  upon  the  rate  of  change  of  the  boundary  and  upon  the  accuracy 
of  the  various  approximations  taken  in  the  algorithm.  • 

6.4  SIMULATIONS  RESULTS 

In  order  to  illustrate  the  performance  of  the  algorithms,  we  include  here  dif¬ 
ferent  simulation  results  of  the  Approximate  Estimate  and  Balancing 
Law.  In  the  first  simulation,  the  boundary  to  be  estimated  is  time  invariant, 
while  in  the  second  is  time-varying. 


Time-invariant  boundary 


As  first  simulation,  we  assume  n  =  3  agents  aim  to  approximate  the  time- 
invariant  boundary  dQ  described  by  the  closed  curve 


7  (6)  =  (2  +  cos(5$)  +  O.5sin(20)) 


cos(0) 
sin(0)  ’ 


9  e  [0,2vr], 


The  control  gain  is  fcprop  =  0.05.  Minimum  and  maximum  velocities  are 
Urn  in  =0.5  and  umax  =  2.  The  number  of  interpolation  points  is  n\p  =  30. 
Pseudodistances  are  computed  with  A  =  -jj.  The  simulation  time  is  50 
seconds.  At  initial  time  the  interpolation  points  are  selected  to  be  the  posi¬ 
tions  of  the  agents  and  other  randomly  distributed  points  on  the  boundary. 
Finally,  each  robot  maintains  a  discretized  representation  of  its  trajectory 
path  with  a  resolution  of  0.01  seconds. 


The  behavior  of  the  Approximate  Estimate  and  Balancing  Law  is 
illustrated  in  Figure  6.8.  The  left  and  right  figures  correspond  to  the  po¬ 
sitions  of  the  interpolation  points  and  the  agents  at  the  initial  and  final 
configurations,  respectively.  In  the  right  figure  one  can  see  the  approximat¬ 
ing  polygon  and  how  close  it  is  to  the  actual  boundary. 


Figure  6.9  illustrates  the  convergence  of  the  algorithm.  Although  the  Ap¬ 
proximate  Estimate  and  Balancing  Law  uses  an  approximated  version 
of  the  pseudodistances  between  interpolation  points  and  of  the  arc-length 
distance  between  agents,  we  illustrate  the  performance  of  the  algorithm  by 
plotting  the  exact  versions  of  the  pseudodistances  and  arc-length  distances. 
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Figure  6.8  Approximate  Estimate  and  Balancing  Law  for  a  time-invariant  boundary: 

initial  and  final  configuration  of  agents  (drawn  as  triangles)  and  interpolation 
points.  The  right  figure  shows  also  the  approximating  polygon. 


Regarding  the  boundary  estimation  task,  the  left  figure  illustrates  how  the 
quantity  maxoe{1  ,...jn.p}  T>x(qa,  qa+i)  ~  min ae{i,...,nip}'D\(qa,  Qa+i)  does  in¬ 
deed  decrease  towards  zero,  even  though  it  does  not  vanish  because  of  the 
adopted  approximations.  Regarding  the  equidistance  task,  the  right  figure 
illustrates  how  the  agents  get  uniformly  spaced  along  the  boundary.  Again, 
the  arc-length  distances  converge  towards  a  common  steady-state  value,  even 
though  the  convergence  is  not  exact  because  of  the  adopted  approximations. 


Figure  6.9  Approximate  Estimate  and  Balancing  Law  for  a  time-invariant  boundary: 

the  left  hgure  shows  largest  minus  smallest  pseudodistance  between  neighbor¬ 
ing  interpolation  points.  The  right  figure  shows  the  three  arc-length  distances 
between  the  three  agents. 


Slowly  time-varying  boundary 


As  second  simulation,  we  assume  n  =  4  agents  aim  to  approximate  the 
time-varying  boundary  dQ  described  by  the  time-varying  closed  curve 


7 (6,  t)  =  (  2  Jlhlll - _  +  ^2  +  cos(5$)  +  0.5  sin(20)^ 


f  final 


)  *  ) 

cos (6) 

'  ^final ) 

sin(0) 
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with  9  £  [0,1],  t  £  [0,  tfinai] ,  1  final  =  200  seconds,  as  shown  in  Figure  6.10. 
The  parameters  and  initial  conditions  of  the  Approximate  Estimate  and 
Balancing  Law  are  the  same  as  in  the  time-invariant  case.  The  four  plots 
in  Figure  6.10  show  the  positions  of  the  interpolation  points  and  the  agents 
at  the  four  time  instants  0,  50,  100,  and  200  seconds,  respectively.  The  last 
plot  also  illustrates  how  close  the  approximating  polygon  is  to  the  actual 
boundary.  From  the  frames  in  Figure  6.10  it  is  clear  that  the  agents  can 
adapt  as  dQ  changes. 


Figure  6.10  Approximate  Estimate  and  Balancing  Law  for  a  time- varying  boundary: 

configuration  of  agents  (drawn  as  triangles)  and  interpolation  points  at  time 
instants  0,  50, 100,  200.  The  last  figure  shows  also  the  approximating  polygon. 


6.5  NOTES 

For  a  discussion  of  hybrid  systems  we  refer  to  [van  der  Schaft  and  Schumacher, 

2000].  Other  relevant  references  include  [Lygeros  et  ah,  2003,  Sanfelice  et  ah, 

2007,  Goebel  et  ah,  2004], 

There  are  many  methods  available  for  the  approximation  of  planar  curves  [Mehaute  et  ah, 
1993],  largely  motivated  by  computational  and  signal  processing  applica¬ 
tions.  Among  them,  the  use  of  interpolated  curves  is  a  standard  and  impor¬ 
tant  approach.  In  their  most  simple  version,  interpolations  provide  polygo¬ 
nal  approximations  of  curves,  with  generalizations  that  make  use  of  splines, 
or  combinations  of  functions  in  a  certain  basis.  In  particular,  the  problem 
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of  characterizing  the  polygons  that  optimally  approximate  a  closed,  convex 
body  is  a  classical  one;  see  the  survey  by  Gruber  [1983].  In  particular,  the 
asymptotic  formula  in  Lemma  6.5  was  extended  in  [Gruber,  1983]  to  higher 
dimensions  in  terms  of  the  Gauss  curvature. 

Boundary  estimation  and  tracking  is  useful  is  numerous  applications  such 
as  the  detection  of  harmful  algae  bloom  [Marthaler  and  Bertozzi,  2003,  Bertozzi  et  ah, 
2004],  oil  spill  [Clark  and  Fierro,  2007],  and  fire  containment  [Casbeer  et  ah, 

2005,  2006].  Marthaler  and  Bertozzi  [2003]  adopt  the  so-called  “snake  algo¬ 
rithm”  (from  the  computer  vision  literature)  to  detect  and  track  the  bound¬ 
ary  of  harmful  algae  bloom.  Each  agent  is  equipped  with  a  chemical  sensor 
that  is  able  to  measure  the  concentration  gradient  and  with  a  communica¬ 
tion  system  that  is  able  to  exchange  information  with  a  data  fusion  center. 

Bertozzi  et  al.  [2004]  suggest  an  algorithm  that  requires  only  a  concentration 
sensor:  the  agents  repeatedly  cross  the  region  boundary  using  a  bang-bang 
angular  velocity  controller.  Clark  and  Fierro  [2007]  use  a  random  coverage 
controller,  a  collision  avoidance  controller  and  a  bang-bang  angular  veloc¬ 
ity  controller  to  detect  and  surround  an  oil  spill.  Casbeer  et  al.  [2006]  de¬ 
scribe  an  algorithm  that  allows  Low  Altitude  Short  Endurance  Unmanned 
Vehicles  (LASEUVs)  to  closely  monitor  the  boundary  of  a  fire.  Each  of 
the  LASEUVs  has  an  infrared  camera  and  a  short  range  communication 
device  to  exchange  information  with  other  agents  and  to  download  the  in¬ 
formation  collected  onto  the  base  station.  In  [Zhang  and  Leonard,  2005], 
a  formation  of  four  robots  tracks  at  unitary  speed  the  level  sets  of  a  field. 

Their  relative  position  changes  so  that  they  optimally  measure  the  gradient 
and  they  estimate  the  curvature  of  the  Held  in  the  center  of  the  formation. 

In  [Zhang  and  Leonard,  2007],  a  controller  is  proposed  to  steer  a  group  of 
constant-speed  robots  onto  an  equally-spaced  configuration  along  a  close 
curve. 

6.6  PROOFS 

This  section  presents  the  main  result  of  the  chapter  on  the  correctness  of 
Theorem  6.12.  For  the  sake  of  completeness,  we  review  first  some  notations 
and  main  concepts  for  Input-to-State-Stability  (ISS)  of  discrete-time  systems 
as  introduced  in  [Jiang  and  Wang,  2001,  Angeli,  1999,  Sontag,  2008].  We 
then  make  use  of  these  to  prove  the  result  of  Theorem  6.12. 


6.6.1  Review  of  ISS  concepts 

A  function  7  :  M>o  — >  M>o  is  a  K-function  if  it  is  continuous,  strictly 
increasing,  and  7(0)  =  0.  A  function  /3  :  M>o  x  M>o  — >  M>o  is  a  KC-function 
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if,  for  each  t  £  R>o,  the  function  s  i— >  /3(s,t)  is  a  /C-function,  and  for  each 
s  £  K>o,  t  i— >  /3(s,  t)  is  decreasing  and  /3(s,  t)  — *  0  as  t  — ►  +oo. 

Consider  the  discrete-time  nonlinear  system 

x(£  +  l )  =  f(£,x(£),u(£)),  (6.6.1) 

where  £  takes  values  in  Z>o,  x  takes  values  in  ffi”,  and  u  takes  values  in  Rm. 
We  assume  that  /  :  Z>o  x  Rn  x  Rm  — >  R"  is  continuous.  In  what  follows, 
we  let  1 1 zi 1 1 2,00  =  sup{||u(^)||2  |  £  £  Z>0}  <  +oo. 

Definition  6.14  (Input-to-state  stability).  The  system  (6.6.1)  is  input- 
to-state  stable  (ISS)  if  there  exist  a  /CX-function  (3  and  a  /C-function  7  such 
that,  for  each  initial  condition  xq  £  Mn  at  time  £q  £  Z>o  and  for  each 
bounded  input  u  :  Z>o  — ►  Rm,  the  system  evolution  x  satisfies,  for  each 
£  >  £ 0 , 

||x(£) 1 1 2  <  (3(\\xo\\2,£-  £ 0 )  +  7(11  ^11 2,00) ■  • 

Definition  6.15  (ISS  Lyapunov  function).  A  function  V  :  Rn  — >  R>o 

is  an  ISS-Lyapunov  function  for  system  (6.6.1)  if 

(i)  it  is  continuously  differentiable; 

(ii)  there  exist  /Coo-functions,  07,  a2,  such  that  cki ( | |a:|| 2)  <  V(x)  < 
a2(lkl|2);  and 

(iii)  there  exist  a  /Coo-function  03  and  a  /C-function  a  such  that 

V(f(x,u))  -  V{x)  <  -a3(||x||2)  +  cr(||u||2).  • 

We  refer  to  [Jiang  and  Wang,  2001]  for  a  proof  of  the  following  result. 

Theorem  6.16  (ISS  and  Lyapunov  functions).  System  (6.6.1)  is  ISS 
if  an  only  if  it  admits  an  ISS-Lyapunov  function. 


6.6.2  Proof  of  Theorem  6.12 


Proof.  In  the  interest  of  brevity,  we  prove  only  the  statements  that  pertain 
to  the  boundary  estimation  task  Te_ bndryi  that  is,  to  the  task 


iQa)  ~ 


<  e, 


for  all  a  £  { 1, . . .  ,7%,}  and  *  £  I.  We  refer  to  [Susca  et  al.,  2008]  for  the 
proof  of  the  statements  regarding  the  agent  equidistance  task. 


We  begin  our  analysis  with  the  case  of  a  single  robot,  i.e.,  we  consider 
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the  Estimate  Update  and  Balancing  Law  algorithm,  and  we  first  con¬ 
sider  the  case  of  a  time-invariant  boundary  with  exact  path  representa¬ 
tion  and  with  no  approximations  in  any  computation.  Define  the  short¬ 
hand  Da  =  T>\(qa,  qa+i),  for  a  G  {1, ... ,  ?%,},  and  the  positive  vector  D  = 

(V\ , . . . ,  Pnip )  G  M”q  .  Let  us  characterize  how  the  vector  V  changes  after 
one  application  of  the  state-transition  function  in  Single- Robot  Estimate  Update  Law 
with  counter  nxt.  We  refer  to  an  application  of  the  state-transition  func¬ 
tion  as  a  projection-and-placement  event.  Because  the  boundary  is  time- 
invariant,  the  projection  operation  (performed  by  the  function  perp-proj) 
leaves  the  interpolation  point  qnxt  unchanged.  Furthermore,  as  discussed  in 
Remark  6.7,  the  placement  operation  (performed  by  the  function  cyclic-balance) 
modifies  the  interpolation  point  gnxt-i  so  that 


Rnxt-2 

+  1 

'3 

1' 

Aixt-2 

Rnxt-l 

“  4 

1 

3 

Aixt-i 

where  we  adopt  the  shorthand  =  V\  (q+,  q^+1),  for  a  G  {1, . . 
nxt  G  {1, . . . ,  nip},  define  Anxt  G  MniPxnip  by 


(^tixt  )jk  —  * 


3/4,  if  (j,k)  equals  (nxt  —  l,nxt  —  1)  or  (nxt  — 
1/4,  if  (J,  k)  or  ( j ,  k )  equals  (nxt  —  1,  nxt  —  2), 
5jk,  otherwise 


.  ,nip}.  For 
2,  nxt  —  2), 


and  define  the  undirected  graph  Gnxt  with  vertices  {1, . . . ,  nip}  and  with  the 
single  edge  (nxt  —  l,nxt  —  2).  In  summary,  the  matrix  Anxt  determines  the 
change  of  state  T>+  =  AnxtV  when  a  projection-and-placement  event  takes 
place  with  counter  nxt  and  its  associated  graph  is  Gnxt. 


Because  the  boundary  has  finite  length  and  the  agent  moves  at  lower- 
bounded  speed,  an  infinite  number  of  projection-and-placement  events  take 
place  for  each  interpolation  point.  After  a  re-parametrization  of  time,  let 
l  G  N  denote  the  times  at  which  projection-and- placements  events  take  place 
and  let  nxt  (I!)  G  {1, . . . ,  ri;p}  denote  the  index  corresponding  to  the  event 
taking  place  at  time  i.  At  each  time  l  G  N,  we  write 

V(£)  =  Anxt{e)V(£-l).  (6.6.2) 

Next,  note  that  Anxt(^),  for  i  G  N,  is  a  non-degenerate  sequence  of  symmet¬ 
ric  and  doubly  stochastic  matrices.  Additionally,  note  that  the  undirected 
graph  UT>£G'nxt(T)  is  connected.  Therefore,  by  Theorem  1.62  and  Corol¬ 
lary  1.67,  we  know  that,  for  all  a  G  {1, ... ,  n\p}, 

1  J*ip,  1 

lim  Va(£ )  =  —  )  T>q(0)  =  — (total  pseudodistance  length  of  d Q). 

t^+oo  ri[p  ?Ap 

This  proves  that  the  interpolation  points  become  equally-spaced  along  dQ 
with  respect  to  pseudodistance  and  that  the  boundary  estimation  task  is 
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achieved  in  the  time-invariant  case.  In  other  words,  this  concludes  the  proof 
of  the  boundary  estimation  part  of  statement  (i)  for  a  single  robot  in  The¬ 
orem  6.12. 

Next,  we  consider  the  Cooperative  Estimate  Update  Law  for  net¬ 
works  of  n  >  2  agents.  Each  agent  has  maintains  a  local  copy  of  the 
pseudodistance  vector  and  of  the  interpolation  points  (which  always  take 
value  in  dQ  because  the  boundary  is  time-invariant).  Specifically,  for  i  e 
{l,...,n},  agent  i  maintains  vector  and  interpolation  points  for 
a  S  {1, . . . ,  nip}.  We  define  the  aggregate  pseudodistance  vector  V  as  fol¬ 
lows:  we  let  T>a  equal  the  most-recently  updated  element  of  the  vector 
{T>a\  . . . ,  T>$},  that  is,  the  pseudodistance  between  the  most-recently  up¬ 
dated  interpolation  points  q$  and  g^+1,  for  i  €  {1, . . .  ,n}.  As  before,  after 
a  re-parametrization  of  time,  let  £  £  N  denote  the  times  at  which  projection- 
and-placements  events  take  place  (independently  of  which  agent  i  executes 
the  event)  and  let  nxt(7)  £  {1, . . . ,  nip}  denote  the  index  corresponding  to 
the  event  taking  place  at  time  £  (independently  of  which  agent  i  executes  the 
event).  Note  that,  when  agent  i  updates  the  interpolation  points  <7nxt(^)-i 
and  qnxt(i)~2i  agent  i  then  transmits  the  updated  points  to  its  immediately 
following  agent  i  —  1,  so  that  the  processor  state  of  agent  i  —  1  contains 
the  correct  updated  information.  Also  note  that,  since  the  boundary  is 
time-invariant,  the  updated  interpolation  points  belong  to  the  trajectory 
path^1!  that  agent  i  —  1  maintains  in  its  memory:  this  fact  guarantees 
that  agent  i  —  i  can  properly  perform  the  cyclic-balance  operation.  In 
summary,  equation  (6.6.2)  is  the  correct  model  not  only  for  the  Single- 
Robot  Estimate  Update  Law  but  also  for  the  Cooperative  Estimate 
Update  Law.  This  concludes  the  proof  of  the  boundary  estimation  part  of 
statement  (i)  for  n  >  2  robots  in  Theorem  6.12. 

Let  us  now  relax  the  assumption  on  the  boundary  and  consider  a  time- 
varying  t  e- >  dQ(t );  as  before  we  first  consider  the  case  of  a  single  robot.  We 
assume  that  pseudodistances  between  interpolation  points  along  the  agent 
path  curve  are  computed  exactly.  By  assumption,  the  boundary  dQ  varies 
in  a  continuously  differentiable  way  and  slowly  in  time,  and  therefore,  the 
projection  of  the  interpolation  points  is  well  defined  and  unique.  For  the 
case  of  time- varying  boundary,  the  state  trajectory  in  continuous  time  is 
a  curve  of  the  form  V  :  M>o  — ►  Mnip  defined  as  follows.  Note  that,  in 
general,  the  interpolation  points  lie  outside  dQ  at  almost  all  times,  and 
therefore,  it  makes  no  sense  to  define  T>a  as  the  pseudodistance  from  point 
Qa  to  qa+i  along  dQ.  Rather,  we  give  the  following  definition:  V  is  the 
vector  of  pseudodistances  computed  by  the  robot  along  the  curve  path.  As 
a  consequence,  the  state  trajectory  V  is  constant  for  almost  all  times  and  it 
changes  only  at  projection-and-placement  events.  Specifically,  let  £  denote 
the  time  at  which  a  projection-and-placements  event  takes  place  with  cor- 

268 


“Distributed  Control  of  Robotic  Networks”  by  F.  Bullo,  J.  Cortes  and  S.  Martinez 
Copyright  ©  2006-2008.  Manuscript  under  contract.  This  version:  October  27,  2008 


DCRN  October  27,  2008 


responding  index  nxt(T)  £  {1, . . .  ,  njp}.  At  time  £,  the  Single-Robot  Es¬ 
timate  Update  Law  computes  new  values  for  the  pseudodistances  T>nx t-2 
and  D nxt-i  based  on  processor  state  of  the  agent  (i.e.,  based  on  the  inter¬ 
polation  points  and  the  path  variable);  these  values  are  the  new  values  of 
the  state  T>.  Because  the  boundary  has  upper-bounded  length  uniformly 
in  time  and  because  the  agent  moves  at  constant  speed,  an  infinite  number 
of  projection-and-placement  events  takes  place  for  each  interpolation  point 
and  the  duration  of  time  between  two  consecutive  events  is  uniformly  upper 
bounded.  Given  this  fact,  we  may  let  £  £  N  serve  as  index  for  all  projection- 
and-placement  times.  Clearly,  if  the  boundary  does  not  vary  with  time,  then 
the  transition  V{£)  =  1)  describes  the  projection-and-placement 

event  at  instant  £.  Because  instead  the  boundary  is  time-varying,  we  model 
the  change  in  T>  due  to  the  boundary  motion  by 

W)  =  AixtW  ip{£  -  1)  +  U{£)) ,  (6.6.3) 

where  U(£)  £  Mn‘p  is  a  disturbance.  By  design,  U(£)  is  nonzero  only  on 
components  (nxt(£)  —  1)  and  (nxt(A')  —  2)  of  V.  By  the  assumptions  that 
the  boundary  varies  in  a  continuously  differentiable  way  and  slowly  with 
time,  and  that  its  length  is  upper  bounded,  we  know  that  U  is  vanishing  in 
the  rate  of  change  of  the  boundary.  Finally,  define  the  disagreement  vector 
£  -»•  d(^)  £  span{lnip}±  by 

1T  V(£) 

d{£)  =  V{£)  -  ntp  ;inip.  (6.6.4) 

nip 

From  equation  (6.6.3)  and  from  the  fact  that  Anxt(£)  is  doubly  stochastic, 
the  update  law  for  d  is: 

d(£)  =  AnxtW  A{£  -  1)  +  u(£),  £  £  N,  (6.6.5) 

where  u(^)  =  U{£)  -  A-l^U (£)lnip . 

Equation  (6.6.5)  is  the  correct  update  equation  even  in  the  case  of  n  >  2 
robots  moving  along  a  time- varying  boundary.  This  fact  is  a  consequence  of 
the  two- hop  separation  assumption;  see  Definition  6.10.  Indeed,  as  explained 
in  Remarks  6.11,  the  inequality  nxtl*-1!  <  nxtW  —  2  guarantees  that  each 
robot  can  correctly  perform  each  projection-and-placement  event.  Given  the 
sequence  £  £  N,  define  a  new  sequence  £k  £  N,  for  k  £  N,  as  follows:  set 
£i  =  1,  assume  without  loss  of  generality  that  agent  1  is  the  agent  executing 
the  first  projection-and-placement  event  with  index  nxt(l),  and  let  £k  >  2 
be  the  fc-th  time  when  agent  1  performs  the  projection-and-placement  event 
with  same  index  nxt(l).  Reasoning  about  the  possible  positions  of  all  agents 
at  time  £k-i  and  £k,  one  can  see  that  £k  —  £k- 1  <  2n  •  nlv.  Define  sequence 
Aik  £  Rni PxriiP;  for  fe  e  N,  by  A(l)  =  Anxt(!)  and 

A(£k)  =  Anxt^fc)  •  •  •  Anxt(£fc_i+2)Anxt^fc_i+1),  for  k  >  2. 
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By  Exercise  El. 16,  each  matrix  4(4)  is  doubly  stochastic  and  irreducible, 
because  it  is  the  product  of  doubly  stochastic  matrices  and  because  the 
union  of  the  undirected  graphs  associated  with  the  matrices  defining  4(4) 
is  connected.  By  definition,  equation  (6.6.5)  becomes,  for  fcgN, 

4 

d(4)  =  4(4)d(4-i)  +  ^2  Aixt(4)  ■  ■  ■  Aixttm)u(^) 

£=£k— i+l 

=  4(4)d(4-l)  +  £>  (4  )  Ugtacked  (4  )  j  (6.6.6) 

where  the  vector  ustacked(4)  contains  all  vectors  u(4-i  +  1), . . . ,  u(4),  and 
the  matrix  £>(4)  is  defined  in  the  trivial  corresponding  way. 

Define  V  :  Mnip  — >  M>o  by  V  (x)  =  xTx  and  adopt  this  function  as  a 
candidate  ISS-Lyapunov  function  for  system  (6.6.6).  We  compute 

V (d(4+i))  -  ^(d(4))  =  -d(4)Ti?(4)d(4) 

+  ustacked(^fc)ustacked(4)  +  2ustac]£eci(4)4(4)d(4)> 

where  i?(4)  =  4lp  —  4(4)74(4)-  Because  4(4)  is  doubly-stochastic  and 
irreducible,  we  know  (from  Exercise  El. 5)  that  i?(4)  is  positive  semidefinite 
and  its  simple  eigenvalue  0  is  associated  with  the  eigenvector  lnip.  This 
fact  implies  that  the  quantity  —xT  R(£k)x  is  strictly  negative  for  all  x  (7 
spanjl^p}-1.  To  upper  bound  this  quantity  by  a  negative  number,  we  let 
4S  be  a  generic  element  of  the  set  of  all  the  possible  matrices  4(4);  such 
matrices  are  the  iterated  products  of  at  most  2 n  ■  n;p  matrices  of  the  form 
4nxt)  where  each  matrix  4nxt,  nxt  E  {1, . . . ,  ri;p},  appears  at  least  once. 
Define  the  set  of  nonzero  eigenvalues  of  4S  by 

Ss  =  {A  e  M  |  det  (A 4ip  -  (4^4S  -  Inj)  =  0}  \  {0} 

and  define  the  eigenvalue  with  smallest  magnitude  among  all  matrices  by 
r  =  mins  min{|A  |  A  E  Ss}.  Note  that  r  >  0  because  we  are  considering  a 
finite  set  of  matrices.  We  can  then  write 

E(d(4  +  1))  -  E(d(4))  <  -a3(||d(4)||)  +  cr ( 1 1 ustacked (4) ||), 

where  a3(||d||)  =  §T||d|[2  and  o-(||ustacked||)  =  (=  +  l)||ustacked||2-  By  Def¬ 
inition  6.15  the  system  described  by  (6.6.6)  is  input-to-state  stable.  The 
input-to-state  stability  implies  the  existence  of  a  positive  e  as  in  the  bound¬ 
ary  estimation  part  of  statement  (ii)  for  n  >  2  robots  in  Theorem  6.12.  ■ 


6.7  EXERCISES 

E6.1  (Alternative  expression  of  the  symmetric  difference).  Show  that  the  sym¬ 
metric  difference  5s  between  two  compact  bodies  C,  B  CM3  can  be  alternatively 
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expressed  as 

5s(C,B)=p(C\B)  +  p(B\C). 

Hint:  Use  the  expression  C  =  (C  \  B)  U  (C  (~l  B)  and  B  =  (B  \  C)  U  (C  (~l  B) . 

E6.2  (Characterization  of  critical  inscribed  polygons  for  the  symmetric  dif¬ 
ference).  Prove  Lemma  6.4.  Also,  show  that  not  all  critical  configurations  are 
optimal.  Specifically,  consider  the  convex  body  and  the  gray  inscribed  triangle 
depicted  in  Figure  E6.1(a).  Show  that  the  gray  triangle  is  a  saddle  configuration 


Figure  E6.1  Illustration  of  the  fact  that  polygons  satisfying  (6.2.1)  may  not  be  optimal 
for  5s. 


for  5s  by  establishing  that  modifications  of  the  triangle  as  in  Figure  E6.1(b)  de¬ 
crease  its  area  (and  hence  increase  5s),  whereas  modifications  of  the  gray  triangle 
as  in  Figure  E6.1(c)  increase  its  area  (and  hence  decrease  5s). 

E6.3  (The  “n-bugs  problem”  and  cyclic  interactions:  continued).  Consider  n 
robots  at  counterclockwise-ordered  positions  9\, ...  ,9n  following  the  cyclic  bal¬ 
ancing  system  described  in  Exercise  El. 29  with  parameter  k  =  1/4: 

ei(i  +  i)  =  ±ei+1(i)  +  ^9i(£)  +  ^ei-1(£),  ie  z>0. 

Show  that: 

distcc  (9i-1(£),9i(£+  1))  =  |  distcc  (9i-i(£),  »<(*))  +  i  distcc  (&(£),  9i+1{£)). 

E6.4  (ISS  properties  of  averaging  algorithms  with  inputs  and  outputs).  This 
is  a  guided  exercise  to  prove  some  of  the  ISS  properties  of  averaging  algorithms 
with  inputs  and  outputs.  An  averaging  algorithm  with  inputs  associated  to  a 
sequence  of  stochastic  matrices  {F(£)  \  l  £  Z>o}  C  R71’*71,  a  sequence  of  input 
gains  {D(£)  \  £  £  Z>oj  C  R”x)c,  and  a  sequence  of  disturbances  u  :  Z>o  — >  R*/  is 
the  discrete-time  dynamical  system 

x{£  +  1)  =  F{£)x(£)  +  D{£)u{£),  £  £  Z>0.  (E6.1) 


A  natural  question  to  ask  is  how  the  evolution  of  the  trajectory  x  is  affected  by 
the  noise  u.  Let  us  address  this  in  the  following.  Define  the  matrix 


T  -1  0 

0  1  -1 


0  ' 
0 


0  ...  0  1  -1 
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Note  that,  with  the  notation  of  Exercise  El. 7,  one  can  write 


Define  the  following  output  for  the  dynamical  system  (E6.1): 


(Xl  -  X2  \ 

:  €  R"_1. 

Xji—  i  xnJ 

This  output  can  be  thought  of  as  an  error  signal  that  quantifies  the  disagreement 
among  the  components  of  the  state.  Now,  consider  the  change  of  variables 


where  awe  €  R  is  the  average  of  the  components  of  x. 

Verify  that  system  (E6.1)  reads  in  the  new  variable  z  as 

z(£  +  1)  =  TF^T-1  z{£)  +  TD{£)u{£). 

The  previous  result  states  in  careful  words  the  following  intuition.  Because  of 
the  definition  of  a  and  of  the  special  structure  of  {F(£)  \  i  £  Z>o},  the  variable 
*ave  plays  no  role  in  the  evolution  of  yeTT .  Accordingly,  we  define  the  error  system 
by 

Ve  rr(£  +  1)  =  Terr  (7)  2/err  {£)  +  Derr(£)U(£),  (E6.2) 

and  the  average  system  by 

Zave(^  +  1)  =  Zave(T)  +  CeII(£)yeII  (£)  +  Dave(£)u(^),  (E6.3) 

for  De „(£)  =  PD{£),  and  Dave(£)  =  £l lD{f). 

Assume  now  that 

(a)  the  sequence  {F(£)  \  £  £  Z>o}  is  a  non-degenerate  sequence  of  stochastic 
matrices; 

(b)  for  £  £  Z>o,  let  G(£)  be  the  unweighted  digraph  associated  with  F(£). 
There  exists  a  duration  (5  £  N  such  that,  for  all  £  £  Z>o  the  digraph 
G(£  +  1)  U  . . .  U  G(£  +  5)  contains  a  globally  reachable  node;  and 

(c)  the  induced  norm  of  {D(£)  \  £  £  Z>o},  for  £  £  Z>o,  is  uniformly  bounded. 
Prove  that,  under  assumptions  (a),  (b)  and  (c)  on  the  averaging  system  with 

inputs,  the  following  equivalent  statements  hold: 

(i)  The  system  (E6.1)  with  output  yelI  is  IOS;  and 

(ii)  the  error  system  (E6.2)  is  ISS. 
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